PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/82, 9A0, 15A1, C08B 30/04 



Al 



(11) International Publication Number: WO 98/37213 

(43) International Publication Date: 27 August 1998 (27.08.98) 



(21) International Application Number: PCT/IB98/00270 

(22) International Piling Date: 23 February 1998 (23.02.98) 



(30) Priority Data: 

9703663.6 
9706060.2 



21 February 1997 (21.02.97) GB 
24 March 1997 (24.03.97) GB 



(71) Applicant (for all designated States except US): DANISCO 

A/S [DK/DK]; Langebrogade 1, P.O. Box 17, DK-1001 
Copenhagen K (DK). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): POULSEN, Peter [DK/DK]; 
Danisco a/s, Langebrogade 1, P.O. Box 17, DK-1001 
Copenhagen K (DK). 

(74) Agents: MASCHIO, Antonio et al.; D Young & Co., 21 New 
Fetter Lane, London EC4A IDA (GB). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, GM, GW, HU, ID, EL, IS, JP, KE, KG, KP, KR, KZ, 
LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, MN, MW, 
MX, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO 
patent (GH, GM, KE, LS, MW, SD, SZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, 
LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, 
CM, GA, GN, ML, MR, NE, SN, TD, TG). 



Published 

With international search report 



(54) Title: ANTISENSE INTRON INHIBITION OF STARCH BRANCHING ENZYME EXPRESSION 



(57) Abstract 

A method of inhibiting gene expression is 
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ANTISENSE INTRON INHIBITION OF STARCH BRANCHING ENZYME EXPRESSION 

The present invention relates to a method of inhibiting gene expression, particularly 
inhibiting gene expression in a plant. The present invention also relates to a nucleotide 
5 sequence useful in the method. In addition, the present invention relates to a promoter 
that is useful for expressing the nucleotide sequence. 

Starch is one of the main storage carbohydrates in plants, especially higher plants. 
The structure of starch consists of amylose and amylopectin. Amylose consists essentially 
of straight chains of a-l-4-linked glycosyl residues. Amylopectin comprises chains of 

10 a-l-4-linked glycosyl residues with some a-1-6 branches. The branched nature of 
amylopectin is accomplished by the action of inter alia an enzyme commonly known as 
the starch branching enzyme ("SBE"). SBE catalyses the formation of branch points in 
the amylopectin molecule by adding a- 1,4 glucans through a-l,6-glucosidic branching 
linkages. The biosynthesis of amylose and amylopectin is schematically shown in Figure 

15 1, whereas the a- 1-4-1 inks and the a-1-6 links are shown in Figure 2. 

In Potato, it is known that two classes of SBE exist. In our copending 
international patent applications PCT/EP96/03052 and PCT/EP96/03053, class B potato 
SBE and a gene encoding it are discussed. In international patent application 
W096/34968, class A potato SBE and a cDNA encoding it are disclosed. 

20 It is known that starch is an important raw material. Starch is widely used in the 

food, paper, and chemical industries. However, a large fraction of the starches used in 
these industrial applications are post-harvest modified by chemical, physical or enzymatic 
methods in order to obtain starches with certain required functional properties. 

Within the past few years it has become desirable to make genetically modified 

25 plants which could be capable of producing modified starches which could be the same as 
the post-harvest modified starches. It is also known that it may be possible to prepare 
such genetically modified plants by expression of antisense nucleotide coding sequences. 
In this regard, June Bourque provides a detailed summary of antisense strategies for the 
genetic manipulations in plants (Bourque 1995 Plant Science 1Q1 PP 125-149). At this 

30 stage, reference could be made to Figure 3 which is a schematic diagram of one of the 
proposed mechanisms of antisense-RNA inhibition. 
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In particular, WO 92/11375 reports on a method of genetically modifying potato 
so as to form amylose-type starch. The method involves the use of an anti-sense 
construct that can apparently inhibit, to a varying extent, the expression of the gene 
coding for formation of the branching enzyme in potato. The antisense construct of WO 
5 92/1 1375 consists of a tuber specific promoter, a transcription start sequence and the first 
exon of the branching enzyme in antisense direction. However, WO 92/11375 does not 
provide any antisense sequence data. In addition, WO 92/11375 only discloses the use of 

the potato GBSS promoter. 

WO 92/14827 reports on a plasmid that, after insertion into the genome of a plant, 
10 can apparently cause changes in the carbohydrate concentration and carbohydrate 
composition, such as the concentration and composition of amylose and amylopectin, in 
the regenerated plant. The plasmid contains pan of the coding sequence of a branching 
enzyme in an antisense orientation. 

EP-A-0647715 reports on the use of antisense endogenous mRNA coding DNA to 
15 alter the characteristics and the metabolic pathways of ornamental plants. 

EP-A-0467349 reports on the expression of sequences that are antisense to 
sequences upstream of a promoter to control gene expression. 

EP-A-0458367 and US-A-5 107065 report on the expression of a nucleotide 
sequence to regulate gene expression in a plant. The nucleotide sequence is 
20 complementary to a mRNA sequence of a gene and may cover all or a portion of the non- 
coding region of the gene. In other words, the nucleotide sequences of EP-A-0458367 
and US-A-5107065 must at least comprise a sequence that is complementary to a coding 
region. EP-A-0458367 and US-A-5107065 contain minimal sequence information. 

W096/34968 discusses the use of antisense sequences complementary to 
25 sequences which encode class A and class B potato SBE to downregulate SBE expression 
in potato plants. The sequences used are complementary to SBE coding sequences. 

Kuipers et al in Mol. Gen. Genet. [1995] 246 745-755 report on the expression of 
a series of nucleotides that are antisense to part of the genomic intron sequences of potato 
granule bound starch synthetase. Here the antisense intron sequences are attached to a 
30 part of the antisense exon sequences - wherein the intron sequences and the exon 
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sequences are naturally associated with each other. In addition, the expressed antisense 
intron sequences are at most 231 bp in length. 

Likewise, Kull et al in J. Genet & Breed. [1995] 42 69-76 report on the 
expression of a series of nucleotides that are antisense to part of the genomic intron 
5 sequences of potato granule bound starch synthetase. Likewise, here the antisense intron 
sequences are attached to a part of the antisense exon sequences - wherein the intron 
sequences and the exon sequences are naturally associated with each other. In addition, 
likewise, the expressed antisense intron sequences are at most 231 bp in length. 

Shimada et al in Theor. Appl, Genet. [1993] 36 665-672 report on the expression 
10 of a series of nucleotides that are antisense to part of the genomic intron sequences of rice 
granule bound starch synthetase. Here the antisense intron sequences are attached to a 
part of the antisense exon sequences - wherein the intron sequences and the exon 
sequences are naturally associated with each other. In addition, the expressed antisense 
intron sequences are less than 350 bp in length. 
15 Reviews on how enzymatic activity can be affected by expression of particular 

nucleotide sequences may be found in the teachings of Finnegan and McElroy [1994] 
Biotechnology 12 883-888; and Matzke and Matzke [1995] TIG 11 1-3. 

Whilst it is known that enzymatic activity can be affected by expression of 
particular nucleotide sequences there is still a need for a method that can more reliably 
20 and/or more efficiently and/or more specifically affect enzymatic activity. 

According to a first aspect of the present invention there is provided a method of 
affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising 
expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence 
wherein the nucleotide sequence partially or completely codes for (is) an intron of the 
25 potato class A SBE gene in an antisense orientation optionally together with a nucleotide 
sequence which codes, partially or completely, for an intron of a class B starch branching 
enzyme in an antisense or sense orientation; and wherein the nucleotide sequence does not 
contain a sequence that is antisense to an exon sequence normally associated with the 
intron. 

30 According to a second aspect of the present invention there is provided a method 

of affecting enzymatic activity in a starch producing organism (or a cell, a tissue or an 
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organ thereof) comprising expressing in the starch producing organism (or a cell, a tissue 
or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, 
partially or completely, for an intron of the potato class A SBE gene, in an antisense 
orientation optionally together with a nucleotide sequence which codes, partially or 
completely, for an intron of a class B starch branching enzyme in an antisense or sense 
orientation; and wherein starch branching enzyme activity is affected and/or the levels of 
amylopectin are affected and/or the composition of starch is changed. 

Preferably, the class A SBE gene antisense intron construct is used in combination 
with a potato class B SBE gene antisense intron construct as defined in PCT/EP96/03052. 
However, it may also be used independently thereof, to target class A SBE alone, or in 
combination with other transgenes, to further manipulate starch quality in potato plants. 

According to a third aspect of the present invention, therefore, there is provided 
an antisense sequence comprising the nucleotide sequence shown as any one of SEQ.I.D. 
No. 15 to SEQ I.D. No. 27 and the complement of SEQ. ID. No.38, or a variant, 
derivative or homologue thereof. 

According to a fourth aspect of the present invention there is provided a promoter 
comprising the sequence shown as SEQ.I.D. No. 14 or a variant, derivative or 
homologue thereof. 

According to a fifth aspect of the present invention there is provided a construct 
capable of comprising or expressing the present invention. 

According to a sixth aspect of the present invention there is provided a vector 
comprising or expressing the present invention. 

According to a seventh aspect of the present invention there is provided a cell, 
tissue or organ comprising or expressing the present invention. 

According to an eighth aspect of the present invention there is provided a 
transgenic starch producing organism comprising or expressing the present invention. 

According to a ninth aspect of the present invention there is provided a starch 
obtained from the present invention. 

According to a tenth aspect of the present invention there is provided pSS17 and 

P SS18. 
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According to an eleventh aspect of the present invention there is provided a 
nucleotide sequence that is antisense to any one or more of the intron sequences 
obtainable from class A SBE, and especially those obtainable from intron lof class A SBE 
as set forth in SEQ. ID. No. 38. 
5 A key advantage of the present invention is that it provides a method for preparing 

modified starches that is not dependent on the need for post-harvest modification of 
starches. Thus the method of the present invention obviates the need for the use of 
hazardous chemicals that are normally used in the post-harvest modification of starches. 
In addition, the present invention provides inter alia genetically modified plants 
10 which are capable of producing modified and/or novel and/or improved starches whose 
properties would satisfy various industrial requirements. 

Thus, the present invention provides a method of preparing tailor-made starches in 
plants which could replace the post-harvest modified starches. 

Also, the present invention provides a method that enables modified starches to be 
15 prepared by a method that can have a more beneficial effect on the environment than the 
known post-harvest modification methods which are dependent on the use of hazardous 
chemicals and large quantities of energy. 

An other key advantage of the present invention is that it provides a method that 
may more reliably and/or more efficiently and/or more specifically affect enzymatic 
20 activity when compared to the known methods of affecting enzymatic activity. With 
regard to this advantage of the present invention it is to be noted that there is some degree 
ofhomology between coding regions of SBEs. However, there is little or no homology 
with the intron sequences of SBEs. 

Thus, antisense intron expression provides a mechanism to affect selectively the 
25 expression of a particular class A SBE. This advantageous aspect could be used, for 
example, to reduce or eliminate a particular SBE enzyme, especially a class A SBE 
enzyme, and replace that enzyme with another enzyme which can be another branching 
enzyme or even a recombinant version of the affected enzyme or even a hybrid enzyme 
which could for example comprise part of a SBE enzyme from one source and at least a 
part of another SBE enzyme from another source. This particular feature of the present 



30 
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invention is covered by the combination aspect of the present invention which is discussed 
in more detail later. 

Thus the present invention provides a mechanism for selectively affecting class A 
SBE activity. This is in contrast to the prior art methods which are dependent on the use 
of for example antisense exon expression whereby it would not be possible to introduce 
new SBE activity without affecting that activity as well. 

In the context of the present invention, class B SBE is synonymous with SBE I: 
class A SBE is synonymous with SBE II. Class A SBE is as defined in W096/34968, 
incorporated herein by reference. Preferably, the antisense intron construct used 
comprises intron 1 of class A SBE, which is 2.0 kb in length and is located starting at 
residue 45 of the coding sequence of class A SBE. The boundaries of the intron may be 
calculated by searching for consensus intron boundary sequences, and are shown in 
attached figure 13. Class B SBE is substantially as defined in the sequences given herein 
and in PCT/EP96/03052. 

Preferably with the first aspect of the present invention starch branching enzyme 
activity is affected and/or the levels of amylopectin are affected and/or the composition of 
starch is changed. 

Preferably with the second aspect of the present invention the nucleotide sequence 
does not contain a sequence that is antisense to an exon sequence normally associated with 
the intron. 

Preferably with the fourth aspect of the present invention the promoter is in 
combination with a gene of interest ("GOI"). 

Preferably the enzymatic activity is reduced or eliminated. 

Preferably the nucleotide sequence codes for at least substantially all of at least 
one intron in an antisense orientation. 

Preferably the nucleotide sequence codes, partially or completely, for two or more 
introns and wherein each intron is in an anti-sense orientation. 

Preferably the nucleotide sequence comprises at least 350 nucleotides (e.g. at least 
350 bp), more preferably at least 500 nucleotides (e.g. at least 500 bp). 

Preferably the nucleotide sequence comprises the complement of the sequence 
shown in SEQ. ID. No. 38, or a fragment thereof. 
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Preferably the nucleotide sequence is expressed by a promoter having a sequence 
shown as SEQ. I.D. No 14 or a variant, derivative or homologue thereof. 
Preferably the transgenic starch producing organism is a plant. 
A preferred aspect of the present invention therefore relates to a method of 
5 affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising 
expressing, in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence 
wherein the nucleotide sequence codes, partially or completely, for an intron in an 
antisense orientation; wherein the nucleotide sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron; and wherein starch 
10 branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed. 

A more preferred aspect of the present invention therefore relates to a method of 
affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising 
expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence 
15 wherein the nucleotide sequence codes, partially or completely, for an intron in an 
antisense orientation; wherein the nucleotide sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron; wherein starch 
branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed; and wherein the nucleotide sequence comprises the 
sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, 
derivative or homologue thereof, including combinations thereof. 

The term "nucleotide" in relation to the present invention includes DNA and 
RNA. Preferably it means DNA, more preferably DNA prepared by use of recombinant 
DNA techniques. 

The term "intron" is used in its normal sense as meaning a segment of 
nucleotides, usually DNA, that is transcribed but does not encode part or all of an 
expressed protein or enzyme. 

The term "exon" is used in its normal sense as meaning a segment of nucleotides, 
usually DNA, encoding part or all of an expressed protein or enzyme. 

Thus, the term "intron" refers to gene regions that are transcribed into RNA 
molecules, but which are spliced out of the RNA before the RNA is translated into a 



20 
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protein. In contrast, the term "exon" refers to gene regions that are transcribed into RNA 
and subsequently translated into proteins. 

The terms "variant" or "homologue" or "fragment" in relation to the nucleotide 
sequence of the present invention include any substitution of, variation of, modification 

5 of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the 
respective nucleotide sequence providing the resultant nucleotide sequence can affect 
enzyme activity in a plant, or cell or tissue thereof, preferably wherein the resultant 
nucleotide sequence has at least the same effect as the complement of the sequence shown 
as SEQ.I.D. No. 38. In particular, the term "homologue" covers homology with respect 

10 to similarity of structure and/or similarity of function providing the resultant nucleotide 
sequence has the ability to affect enzymatic activity in accordance with the present 
invention. With respect to sequence homology (i.e. similarity), preferably there is more 
than 80% homology, more preferably at least 85% homology, more preferably at least 
90% homology, even more preferably at least 95% homology, more preferably at least 

15 98% homology. The above terms are also synonymous with allelic variations of the 
sequences. 

Likewise, the terms "variant" or "homologue" or "fragment" in relation to the 
promoter of the present invention include any substitution of, variation of, modification 
of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the 

20 respective promoter sequence providing the resultant promoter sequence allows 
expression of a GOI, preferably wherein the resultant promoter sequence has at least the 
same effect as SEQ.I.D. No. 14. In particular, the term "homologue" covers homology 
with respect to similarity of structure and/or similarity of function providing the resultant 
promoter sequence has the ability to allow for expression of a GOI, such as a nucleotide 

25 sequence according to the present invention. With respect to sequence homology (i.e. 
similarity), preferably there is more than 80% homology, more preferably at least 85% 
homology, more preferably at least 90% homology, even more preferably at least 95% 
homology, more preferably at least 98% homology. The above terms are also 
synonymous with allelic variations of the sequences. 
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The term "antisense" means a nucleotide sequence that is complementary to, and 
can therefore hybridise with, any one or all of the intron sequences of the present 
invention, including partial sequences thereof. 

With the present invention, the antisense intron can be complementary to an entire 
5 intron of the gene to be inhibited. However, in some circumstances, partial antisense 
sequences may be used (i.e. sequences that are not or do not comprise the full 
complementary sequence) providing the partial sequences affect enzymatic activity. 
Suitable examples of partial sequences include sequences that are shorter than the full 
complement of SEQ. ID. No. 38 but which comprise nucleotides that are at least 
10 antisense to the sense intron sequences adjacent the respective exon or exons. 

With regard to the second aspect of the present invention (i.e. specifically 
affecting SBE activity), the nucleotide sequences of the present invention may comprise 
one or more sense or antisense exon sequences of the SBE gene, including complete or 
partial sequences thereof, providing the nucleotide sequences can affect SBE activity, 
15 preferably wherein the nucleotide sequences reduce or eliminate SBE activity. 
Preferably, the nucleotide sequence of the second aspect of the present invention does not 
comprise an antisense exon sequence. 

The term "vector" includes an expression vector and a transformation vector. The 
term "expression vector" means a construct capable of in vivo or in vitro expression. The 
20 term "transformation vector" means a construct capable of being transferred from one 
species to another - such as from an E Coti plasmid to a fungus or a plant cell, or from an 
Agrobacterium to a plant cell. 

The term "construct" - which is synonymous with terms such as "conjugate", 
"cassette" and "hybrid" - in relation to the antisense nucleotide sequence aspect of the 
25 present invention includes the nucleotide sequence according to the present invention 
directly or indirectly attached to a promoter. An example of an indirect attachment is the 
provision of a suitable spacer group such as an intron sequence, such as the SW-intron or 
the ADH intron, intermediate the promoter and the nucleotide sequence of the present 
invention. The same is true for the term "fused" in relation to the present invention 
30 which includes direct or indirect attachment. The terms do not cover the natural 
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combination of the wild type SBE gene when associated with the wild type SBE gene 
promoter in their natural environment. 

The construct may even contain or express a marker which allows for the 
selection of the genetic construct in, for example, a plant cell into which it has been 
5 transferred. Various markers exist which may be used in, for example, plants - such as 
mannose. Other examples of markers include those that provide for antibiotic resistance - 
e.g. resistance to G418, hygromycin, bleomycin, kanamycin and gentamycin. 

The construct of the present invention preferably comprises a promoter. The term 
"promoter" is used in the normal sense of the art, e.g. an RNA polymerase binding site in 
10 the Jacob-Monod theory of gene expression. Examples of suitable promoters are those 
that can direct efficient expression of the nucleotide sequence of the present invention 
and/or in a specific type of cell. Some examples of tissue specific promoters are 
disclosed in WO 92/11375. 

The promoter could additionally include conserved regions such as a Pribnow Box 
15 or a TATA box. The promoters may even contain other sequences to affect (such as to 
maintain, enhance, decrease) the levels of expression of the nucleotide sequence of the 
present invention. Suitable examples of such sequences include the SW-intron or an 
ADH intron. Other sequences include inducible elements - such as temperature, 
chemical, light or stress inducible elements. Also, suitable elements to enhance 
20 transcription or translation may be present. An example of the latter element is the TMV 
5- leader sequence (see Sleat Gene 217 [1987] 217-225; and Dawson Plant Mol. Biol. 23 
[1993] 97). 

As mentioned, the construct and/or the vector of the present invention may 
include a transcriptional initiation region which may provide for regulated or constitutive 
25 expression. Any suitable promoter may be used for the transcriptional initiation region, 
such as a tissue specific promoter. In one aspect, preferably the promoter is the patatin 
promoter or the E35S promoter. In another aspect, preferably the promoter is the SBE 
promoter. 

If, for example, the organism is a plant then the promoter can be one that affects 
30 expression of the nucleotide sequence in any one or more of seed, tuber, stem, sprout, 
root and leaf tissues, preferably tuber. By way of example, the promoter for the 
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nucleotide sequence of the present invention can be the a-Amy 1 promoter (otherwise 
known as the Amy 1 promoter, the Amy 637 promoter or the a-Amy 637 promoter) as 
described in our co-pending UK patent application No. 9421292.5 filed 21 October 1994. 
Alternatively, the promoter for the nucleotide sequence of the present invention can be the 
5 a-Amy 3 promoter (otherwise known as the Amy 3 promoter, the Amy 351 promoter or 
the a-Amy 351 promoter) as described in our co-pending UK patent application No. 
9421286.7 filed 21 October 1994. 

The present invention also encompasses the use of a promoter to express a 
nucleotide sequence according to the present invention, wherein a part of the promoter is 
10 inactivated but wherein the promoter can still function as a promoter. Partial inactivation 
of a promoter in some instances is advantageous. In particular, with the Amy 351 
promoter mentioned earlier it is possible to inactivate a part of it so that the partially 
inactivated promoter expresses the nucleotide sequence of the present invention in a more 
specific manner such as in just one specific tissue type or organ. The term "inactivated" 
15 means partial inactivation in the sense that the expression pattern of the promoter is 
modified but wherein the partially inactivated promoter still functions as a promoter. 
However, as mentioned above, the modified promoter is capable of expressing a gene 
coding for the enzyme of the present invention in at least one (but not all) specific tissue 
of the original promoter. Examples of partial inactivation include altering the folding 
20 pattern of the promoter sequence, or binding species to parts of the nucleotide sequence, 
so that a part of the nucleotide sequence is not recognised by, for example, RNA 
polymerase. Another, and preferable, way of partially inactivating the promoter is to 
truncate it to form fragments thereof. Another way would be to mutate at least a part of 
the sequence so that the RNA polymerase can not bind to that part or another part. 
25 Another modification is to mutate the binding sites for regulatory proteins for example the 
CreA protein known from filamentous fungi to exert carbon catabolite repression, and 
thus abolish the catabolite repression of the native promoter. 

The construct and/or the vector of the present invention may include a 
transcriptional termination region. 
30 The nucleotide according to the present invention can be expressed in combination 

(but not necessarily at the same time) with an additional construct. Thus the present 
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invention also provides a combination of constructs comprising a first construct 
comprising the nucleotide sequence according to the present invention operatively linked 
to a first promoter; and a second construct comprising a GOI operatively linked to a 
second promoter (which need not be the same as the first promoter). With this aspect of 
the present invention the combination of constructs may be present in the same vector, 
plasmid, cells, tissue, organ or organism. This aspect of the present invention also covers 
methods of expressing the same, preferably in specific cells or tissues, such as expression 
in just a specific cell or tissue, of an organism, typically a plant. With this aspect of the 
present invention the second construct does not cover the natural combination of the gene 
coding for an enzyme ordinarily associated with the wild type gene promoter when they 
are both in their natural environment. 

An example of a suitable combination would be a first construct comprising the nucleotide 
sequence of the present invention and a promoter, such as the promoter of the present 
invention, and a second construct comprising a promoter, such as the promoter of the 
present invention, and a GOI wherein the GOI codes for another starch branching enzyme 
either in sense or antisense orientation. 

The above comments relating to the term "construct" for the antisense nucleotide 
aspect of the present invention are equally applicable to the term "construct" for the 
promoter aspect of the present invention. In this regard, the term includes the promoter 
according to the present invention directly or indirectly attached to a GOI. 

The term "GOI" with reference to the promoter aspect of the present invention or 
the combination aspect of the present invention means any gene of interest, which need 
not necessarily code for a protein or an enzyme - as is explained later. A GOI can be any 
nucleotide sequence that is either foreign or natural to the organism in question, for 
example a plant. 

Typical examples of a GOI include genes encoding for other proteins or enzymes 
that modify metabolic and catabolic processes. The GOI may code for an agent for 
introducing or increasing pathogen resistance. 
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The GOI may even be an antisense construct for modifying the expression of 
natural transcripts present in the relevant tissues. An example of such a GOI is the 
nucleotide sequence according to the present invention. 

The GOI may even code for a protein that is non-natural to the host organism - 

5 e.g. a plant. The GOI may code for a compound that is of benefit to animals or humans. 
For example, the GOI could code for a pharmaceutically active protein or enzyme such as 
any one of the therapeutic compounds insulin, interferon, human serum albumin, human 
growth factor and blood clotting factors. The GOI may even code for a protein giving 
additional nutritional value to a food or feed or crop. Typical examples include plant 

10 proteins that can inhibit the formation of anti-nutritive factors and plant proteins that have 
a more desirable amino acid composition (e.g. a higher lysine content than a non- 
transgenic plant). The GOI may even code for an enzyme that can be used in food 
processing such as xylanases and a-galactosidase. The GOI can be a gene encoding for 
any one of a pest toxin, an antisense transcript such as that for cc-amylase, a protease or a 

15 glucanase. Alternatively, the GOI can be a nucleotide sequence according to the present 
invention. 

The GOI can be the nucleotide sequence coding for the arabinofuranosidase 
enzyme which is the subject of our co-pending UK patent application 9505479.7. The 
GOI can be the nucleotide sequence coding for the glucanase enzyme which is the subject 
20 of our co-pending UK patent application 9505475.5. The GOI can be the nucleotide 
sequence coding for the a-amylase enzyme which is the subject of our co-pending UK 
patent application 9413439.2. The GOI can be the nucleotide sequence coding for the a- 
amylase enzyme which is the subject of our co-pending UK patent application 9421290.9. 
The GOI can be any of the nucleotide sequences coding for the a-glucan lyase enzyme 
25 which are described in our co-pending PCT patent application PCT/EP94/03397 . 

In one aspect the GOI can even be a nucleotide sequence according to the present 
invention but when operatively linked to a different promoter. 

The GOI could include a sequence that codes for one or more of a xylanase, an 
arabinase, an acetyl esterase, a rhamnogalacturonase, a glucanase, a pectinase, a 
30 branching enzyme or another carbohydrate modifying enzyme or proteinase. 
Alternatively, the GOI may be a sequence that is antisense to any of those sequences. 
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As mentioned above, the present invention provides a mechanism for selectively 
affecting a particular enzymatic activity. In an important application of the present 
invention it is now possible to reduce or eliminate expression of a genomic nucleotide 
sequence coding for a genomic protein or enzyme by expressing an antisense intron 
construct for that particular genomic protein or enzyme and (e.g. at the same time) 
expressing a recombinant version of that enzyme or protein - in other words the GOI is a 
recombinant nucleotide sequence coding for the genomic enzyme or protein. This 
application allows expression of desired recombinant enzymes and proteins in the absence 
of (or reduced levels of) respective genomic enzymes and proteins. Thus the desired 
recombinant enzymes and proteins can be easily separated and purified from the host 
organism. This particular aspect of the present invention is very advantageous over the 
prior art methods which, for example, rely on the use of anti-sense exon expression which 
methods also affect expression of the recombinant enzyme. 

Thus, a further aspect of the present invention relates to a method of expressing a 
recombinant protein or enzyme in a host organism comprising expressing a nucleotide 
sequence coding for the recombinant protein or enzyme; and expressing a further 
nucleotide sequence wherein the further nucleotide sequence codes, partially or 
completely, for an intron in an antisense orientation; wherein the intron is an intron 
normally associated with the genomic gene encoding a protein or an enzyme 
corresponding to the recombinant protein or enzyme; and wherein the further nucleotide 
sequence does not contain a sequence that is antisense to an exon sequence normally 
associated with the intron. Additional aspects cover the combination of those nucleotide 
sequences including their incorporation in constructs, vectors, cells, tissues and transgenic 
organisms. 

Therefore the present invention also relates to a combination of nucleotide 
sequences comprising a first nucleotide sequence coding for a recombinant enzyme; and a 
second nucleotide sequence which corresponds to an intron in antisense orientation; 
wherein the intron is an intron that is associated with a genomic gene encoding an enzyme 
corresponding to the recombinant enzyme; and wherein the second nucleotide sequence 
does not contain a sequence that is antisense to an exon sequence normally associated with 
the intron. 
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The GOI may even code for one or more introns, such as any one or more of the 
intron sequences presented in the attached sequence listings. For example, the present 
invention also covers the expression of for example an antisense intron (e.g. the 
complement of SEQ. ID. No. 38) in combination with for example a sense intron which 
5 preferably is not complementary to the antisense intron sequence (e.g. SEQ.I.D.No. 2 or 
another class A SBE intron). 

The terms "cell", "tissue" and "organ" include cell, tissue and organ per se and 

when within an organism. 

The term "organism" in relation to the present invention includes any organism 
10 that could comprise the nucleotide sequence according to the present invention and/or 
wherein the nucleotide sequence according to the present invention can be expressed when 
present in the organism. Preferably the organism is a starch producing organism such as 
any one of a plant, algae, fungi, yeast and bacteria, as well as cell lines thereof. 
Preferably the organism is a plant. 
15 The term "starch producing organism" includes any organism that can 

biosynthesise starch. Preferably, the starch producing organism is a plant. 

The term "plant" as used herein includes any suitable angiosperm, gymnosperm, 
monocotyledon and dicotyledon. Typical examples of suitable plants include vegetables 
such as potatoes; cereals such as wheat, maize, and barley; fruit; trees; flowers; and other 
20 plant crops. Preferably, the term means "potato". 

The term "transgenic organism" in relation to the present invention includes any 
organism that comprises the nucleotide sequence according to the present invention and/or 
products obtained therefrom, and/or wherein the nucleotide sequence according to the 
present invention can be expressed within the organism. Preferably the nucleotide 
25 sequence of the present invention is incorporated in the genome of the organism. 
Preferably the transgenic organism is a plant, more preferably a potato. 

To prepare the host organism one can use prokaryotic or eukaryotic organisms. 
Examples of suitable prokaryotic hosts include E. coli and Bacillus subtilis. Teachings on 
the transformation of prokaryotic hosts is well documented in the art, for example see 
30 Sambrook et al (Sambrook et al. in Molecular Cloning: A Laboratory Manual, 2nd 
edition. 1989, Cold Spring Harbor Laboratory Press). 



10 
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Even though the enzyme according to the present invention and the nucleotide 
sequence coding for same are not disclosed in EP-B-0470145 and CA-A-2006454, those 
two documents do provide some useful background commentary on the types of 
techniques that may be employed to prepare transgenic plants according to the present 
invention. Some of these background teachings are now included in the following 
commentary. 

The basic principle in the construction of genetically modified plants is to insert 
genetic information in the plant genome so as to obtain a stable maintenance of the 

inserted genetic material. 

Several techniques exist for inserting the genetic information, the two main 
principles being direct introduction of the genetic information and introduction of the 
genetic information by use of a vector system. A review of the general techniques may 
be found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205- 
225) and Christou (Agro-Food-Industry Hi-Tech March/April 1994 17-27). 
15 Thus, in one aspect, the present invention relates to a vector system which carries 

a nucleotide sequence or construct according to the present invention and which is capable 
of introducing the nucleotide sequence or construct into the genome of an organism, such 
as a plant. 

The vector system may comprise one vector, but it can comprise two vectors. In 
the case of two vectors, the vector system is normally referred to as a binary vector 
system. Binary vector systems are described in further detail in Gynheung An et al. 
(1980), Binary Vectors, Plant Molecular Biology Manual A3, 1-19. 

One extensively employed system for transformation of plant cells with a given 
promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from 
25 Agrobacterium tumefaciens or a Ri plasmid from Agrobacterium rhizogenes An et al. 
(1986), Plant Physiol. 81, 301-305 and Butcher D.N. et al. (1980), Tissue Culture 
Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Helgeson, 203-208. Several 
different Ti and Ri plasmids have been constructed which are suitable for the construction 
of the plant or plant cell constructs described above. A non-limiting example of such a Ti 
30 plasmid is pGV3850. 



20 
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The nucleotide sequence or construct of the present invention should preferably be 
inserted into the Ti-plasmid between the terminal sequences of the T-DNA or adjacent a 
T-DNA sequence so as to avoid disruption of the sequences immediately surrounding the 
T-DNA borders, as at least one of these regions appears to be essential for insertion of 
5 modified T-DNA into the plant genome. 

As will be understood from the above explanation, if the organism is a plant the 
vector system of the present invention is preferably one which contains the sequences 
necessary to infect the plant (e.g. the vir region) and at least one border part of a T-DNA 
sequence, the border part being located on the same vector as the genetic construct. 
10 Furthermore, the vector system is preferably an Agrobacterium tumefaciens Ti- 

plasmid or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof. As these 
plasmids are well-known and widely employed in the construction of transgenic plants, 
many vector systems exist which are based on these plasmids or derivatives thereof. 

In the construction of a transgenic plant the nucleotide sequence or construct of 
15 the present invention may be first constructed in a microorganism in which the vector can 
replicate and which is easy to manipulate before insertion into the plant. An example of a 
useful microorganism is E. coli, but other microorganisms having the above properties 
may be used. When a vector of a vector system as defined above has been constructed in 
E. coli, it is transferred, if necessary, into a suitable Agrobacterium strain, e.g. 
20 Agrobacterium tumefaciens. The Ti-plasmid harbouring the nucleotide sequence or 
construct of the present invention is thus preferably transferred into a suitable 
Agrobacterium strain, e.g. A. tumefaciens, so as to obtain an Agrobacterium cell 
harbouring the promoter or nucleotide sequence or construct of the present invention, 
which DNA is subsequently transferred into the plant cell to be modified. 
25 If, for example, for the transformation the Ti- or Ri-plasmid of the plant cells is 

used, at least the right boundary and often however the right and the left boundary of the 
Ti- and Ri-plasmid T-DNA, as flanking areas of the introduced genes, can be connected. 
The use of T-DNA for the transformation of plant cells has been intensively studied and is 
described in EP-A-120516; Hoekema, in: The Binary Plant Vector System Offset- 
30 drukkerij Ranters B.B., Alblasserdam, 1985, Chapter V; Fraley, et al., Crit. Rev. Plant 
Sci., 4:1-46: and An et al., EMBO J. (1985) 4:277-284. 
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Direct infection of plant tissues by Agrobacterium is a simple technique which has 
been widely employed and which is described in Butcher D.N. et al. (1980), Tissue 
Culture Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Helgeson, 203-208. 
For further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant Mol Biol 
[1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/April 1994 17- 
27). With this technique, infection of a plant may be performed in or on a certain part or 
tissue of the plant, i.e. on a part of a leaf, a root, a stem or another part of the plant. 

Typically, with direct infection of plant tissues by Agrobacterium carrying the 
GOI (such as the nucleotide sequence according to the present invention) and, optionally, 
a promoter, a plant to be infected is wounded, e.g. by cutting the plant with a razor blade 
or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound is 
then inoculated with the Agrobacterium. The inoculated plant or plant part is then grown 
on a suitable culture medium and allowed to develop into mature plants. 

When plant cells are constructed, these cells may be grown and maintained in 
accordance with well-known tissue culturing methods such as by culturing the cells in a 
suitable culture medium supplied with the necessary growth factors such as amino acids, 
plant hormones, vitamins, etc. 

Regeneration of the transformed cells into genetically modified plants may be 
accomplished using known methods for the regeneration of plants from cell or tissue 
cultures, for example by selecting transformed shoots using an antibiotic and by 
subculturing the shoots on a medium containing the appropriate nutrients, plant 
hormones, etc. 

Further teachings on plant transformation may be found in EP-A-0449375. 

As reported in CA-A-2006454, a large amount of cloning vectors are available 
which contain a replication system in E. coli and a marker which allows a selection of the 
transformed cells. The vectors contain for example pBR 322, pUC series, M13 mp 
series, pACYC 184 etc. In this way, the nucleotide or construct of the present invention 
can be introduced into a suitable restriction position in the vector. The contained plasmid 
is then used for the transformation in E.coli. The E.coli cells are cultivated in a suitable 
nutrient medium and then harvested and lysed. The plasmid is then recovered. As a 
method of analysis there is generally used sequence analysis, restriction analysis, 
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electrophoresis and further biochemical-molecular biological methods. After each 
manipulation, the used DNA sequence can be restricted and connected with the next DNA 
sequence. Each sequence can be cloned in the same or different plasmid. 

After the introduction of the nucleotide sequence or construct according to the 
5 present invention in the plants the presence and/or insertion of further DNA sequences 
may be necessary - such as to create combination systems as outlined above (e.g. an 
organism comprising a combination of constructs). 

The above commentary for the transformation of prokaryotic organisms and plants 
with the nucleotide sequence of the present invention is equally applicable for the 
10 transformation of those organisms with the promoter of the present invention. 

In summation, the present invention relates to affecting enzyme activity by 
expressing antisense intron sequences. 

Also, the present invention relates to a promoter useful for the expression of those 

antisense intron sequences. 
15 The following samples have been deposited in accordance with the Budapest 

Treaty at the recognised depositary The National Collections of Industrial and Marine 
Bacteria Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scotland, AB2 1RY, 
United Kingdom, on 13 July 1995: 

NCIMB 40753 (which refers to pBEA 8 as described herein); 
20 NCIMB 40751 (which refers to X-SBE 3.2 as described herein), and 

NCIMB 40752 (which refers to X-SBE 3.4 as described herein). 

The following sample has been deposited in accordance with the Budapest Treaty 
at the recognised depositary The National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scotland, AB2 1RY, United 

25 Kingdom, on 9 July 1996: 

NCIMB 40815 (which refers to pBEA 9 as described herein). 
A highly preferred embodiment of the present invention therefore relates to a 
method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) 
comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide 
30 sequence wherein the nucleotide sequence codes, partially or completely, for an intron in 
an antisense orientation; wherein the nucleotide sequence does not contain a sequence that 
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is antisense to an exon sequence normally associated with the intron; wherein starch 
branching enzyme activity is affected and/or the levels of amylopectin are affected and/or 
the composition of starch is changed; and wherein the nucleotide sequence is antisense to 
intron 1 of class A SBE as set forth in SEQ. ID. No. 38, or any other intron of class A 

5 SBE, including fragments thereof, and including combinations of class A antisense intron 
sequences and class B antisense intron sequences. The sequence of introns of class A 
SBE other than intron 1 may be obtained by sequencing of, for example, potato class A 
SBE genomic DNA, isolatable by hybridisation screening of a genomic DNA library with 
class A SBE cDNA obtainable according to W096/34968 according to methods well 

10 known in the art and set forth, for example, in Sambrook et al. , Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor, 1989. 

The present invention will now be described only by way of example, in which 
reference is made to the following attached Figures: 

Figure 1, which is a schematic representation of the biosynthesis of amylose and 

15 amylopectin; 

Figure 2, which is a diagrammatic representation of the a-l-4-links and the a-1-6 

links of amylopectin; 

Figure 3, which is a diagrammatic representation of a possible antisense-RNA 

inhibition mechanism; 

20 Figure 4, which is a diagrammatic representation of the exon-intron structure of a 

genomic SBE clone; 

Figure 5, which is a plasmid map of pPATAl, which is 3936 bp in size; 

Figure 6, which is a plasmid map of pABE6, which is 5106 bp in size; 

Figure 7, which is a plasmid map of pVictorlV Man, which is 7080 bp in size; 
25 Figure 8, which is a plasmid map of pBEA8, which is 9.54 kb in size; 

Figure 9, which is a plasmid map of pBEA9, which is 9.54 kb in size; 

Figure 10, which is a plasmid map of pBEP2, which is 10.32 kb in size; 

Figure 1 1 , which is a plasmid map of pVictor5a, which is 9. 12 kb in size; 

Figure 12, which shows the full genomic nucleotide sequence for SBE including 
30 the promoter, exons and introns; 
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Figure 13, which shows the positioning of intron 1 in the class A and class B SBE 

genes; 

Figure 14, which shows the sequence of intron 1 of the potato class A SBE; 

Figure 15 , which shows the structure of pSS17; and 
5 Figure 16, which shows the structure of pSS18. 

Figures 1 and 2 were referred to above in the introductory description concerning 
starch in general. Figure 3 was referred to above in the introductory description 
concerning antisense expression. 

As mentioned, Figure 4 is a diagrammatic representation of the exon-intron 
10 structure of a genomic SBE clone, the sequence of which is shown in Figure 12. This 
clone, which has about 11.5 k base pairs, comprises 14 exons and 13 introns. The 
introns are numbered in increasing order from the 5' end to the 3' end and correspond to 
SEQ.LD.No.s 1-13, respectively. Their respective antisense intron sequences are shown 

as SEQ.LD.No.s 15-27. 

15 In more detail, Figures 4 and 12 present information on the 1 1478 base pairs of a 

potato SBE gene. The 5' region from nucleotides 1 to 2082 contain the promoter region 
of the SBE gene. A TATA box candidate at nucleotide 2048 to 2051 is boxed. The 
homology between a potato SBE cDNA clone (Poulsen & Kreiberg (1993) Plant Physiol 
102: 1053-1054) and the exon DNAs begin at 2083 bp and end at 9666 bp. 

20 The homology between the cDNA and the exon DNA is indicated by nucleotides 

in upper case letters, while the translated amino acid sequences are shown in the single 
letter code below the exon DNA. Intron sequences are indicated by lower case letters. 

Figures 5 to 7 are discussed below. As mentioned, Figure 8 is a plasmid map of 
P BEA8, which is 9.54 k base pairs in size; and Figure 9 is a plasmid map of pBEA9, 

25 which is 9.54 k base pairs in size. Each of pBEA 8 and pBEA 9 comprises an antisense 
sequence to the first intron sequence of the potato SBE gene. This first intron sequence, 
which has 1177 base pairs, is shown in Figure 4 and lies between the first exon and the 
second exon. 

These experiments and aspects of the present invention are now discussed in more 

30 detail. 
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ISOLATION, SUBCLONING IN PLASMIDS, AND SEQUENCING OF GENOMIC 

CLASS B SBE CLONES 
5 Various clones containing the potato class B SBE gene are isolated from a Desiree 

potato genomic library (Clontech Laboratories Inc., Palo Alto CA, USA) using 
radioactively labelled potato SBE cDNA (Poulsen & Kreiberg (1993) Plant Physiol. 
102:1053-1054) as probe. The fragments of the isolated X-phages containing SBE DNA 
(XSBE 3.2 - NCIMB 40751 - and XSBE-3.4 - NCIMB 40752) are identified by Southern 
10 analysis and then subcloned into pBluescript H vectors (Clontech Laboratories Inc., Palo 
Alto CA, USA). XSBE 3.2 contains a 15 kb potato DNA insert and XSBE-3.4 contains a 
13 kb potato DNA insert. The resultant plasmids are called pGB3, pGBU, pGB15, 
P GB16 and pGB25 (see discussion below). The respective inserts are then sequenced 
using the Pharmacia Autoread Sequencing Kit (Pharmacia, Uppsala) and a A.L.F. DNA 
15 sequencer (Pharmacia, Uppsala). 

In total, a stretch of 11.5 kb of the class B SBE gene is sequenced. The sequence 
is deduced from the above-mentioned plasmids, wherein: pGB25 contains the sequences 
from 1 bp to 836 bp, P GB15 contains the sequences from 735 bp to 2580 bp, pGB16 
contains the sequences from 2580 bp to 5093 bp, pGBU contains the sequences from 
20 3348 bp to 7975 bp, and pGB3 contains the sequences from 7533 bp to 1 1468 bp. 

In more detail, P GB3 is constructed by insertion of a 4 kb EcoRl fragment isolated 
from XSBE 3.2 into the EcoRl site of pBluescript H SK (+)• pGBU is constructed by 
insertion of a 4.7 kb Xhol fragment isolated from XSBE 3.4 into the Xhol site of 
pBluescript II SK (+). P GB15 is constructed by insertion of a 1.7 kb Spel fragment 
25 isolated from XSBE 3.4 into the Spel site of pBluescript II SK (+). P GB16 is constructed 
by insertion of a 2.5 kb Spel fragment isolated from XSBE 3.4 into the Spel site of 
pBluescript II SK (+). For the construction of pGB25 a PCR fragment is produced with 
the primers 

5' GGA ATT CCA GTC GCA GTC TAC ATT AC 3' 

(SEQ. ID. No.30) 



30 
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5" CGG GAT CCA GAG GCA TTA AGA TTT CTG G 3' 

(SEQ. ID. No. 31) 

and XSBE 3.4 as a template. 

The PCR fragment is digested with BamHl and EcoRL, and inserted in pBluescript 
5 II SK (+) digested with the same restriction enzymes. 
A class A SBE clone is derived similarly. 

CONSTRUCTION OF CLASS B SBE ANTISENSE INTRON PLASMIDS pBEA8 
and pBEA9 

10 The SBE intron 1 is amplified by PCR using the oligonucleotides: 

5' CGG GAT CCA AAG AAA TTC TCG AGG TTA CAT GG 3' 

(SEQ. ID. No. 32) 

and 

5' CGG GAT CCG GGG TAA TTT TTA CTA ATT TCA TG 3' 
15 (SEQ. ID. No. 33) 

and the XSBE 3.4 phage containing the SBE gene as template. 

The PCR product is digested with BamHl and inserted in an antisense orientation 
in the BamHl site of plasmid pPATAl (described in WO 94/24292) between the patatin 
promoter and the 35S terminator. This construction, pABE6, is digested with Kpnl, and 

20 the 2.4 kb "patatin promoter-SBE intron 1- 35S terminator" Kpnl fragment is isolated and 
inserted in the Kpnl site of the plant transformation vector pVictorlV Man. The Kpnl 
fragment is inserted in two orientations yielding plasmids pBEA8 and pBEA9. pVictorlV 
Man is shown in Figure 7 and is formed by insertion of a filled in Xbal fragment 
containing a E35S promoter-manA-35S terminator cassette isolated from plasmid 

25 pVictorlV SGiN Man (WO 94/24292) into the filled in Xhol site of pVictor IV. The 
pVictor regions of pVictor IV Man contained between the co-ordinates 2.52 bp to 0.32 bp 
(see Figure 7). 



10 
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CONSTRUCTION OF CLASS A SBE ANTISENSE INTRON PLASMIDS pSS17 
and pSS18 

Construction of plasmid pSS17. 

The 2122 bp intron 1 sequence of the potato SBEII gene is amplified by PCR 
from a genomic SBEII subclone using the primers 5' - CGG GAT CCC GTA TGT 
CTC ACT GTG TTT GTG GC - 3' (SEQ. ID. No. 34) and 5' - CGG GAT CCC CCT 
ACA TAC ATA TAT CAG ATT AG - 3' (SEQ. ID. No. 35). The PCR product is 
digested with BamHI and inserted in antisense orientation after a patatin promoter in the 
BamHI site of a plant transformation vector in which the NPTII gene is used as 
selectable marker (see figure 15). 



Construction of plasmid pSS18. 

The 2122 bp intron 1 sequence of the potato. SBEII gene is amplified by PCR 

15 from a genomic SBEII subclone using the primers 5* - CGG GAT CCC GTA TGT 
CTC ACT GTG TTT GTG GC - 3' (SEQ. ID. No. 34) and 5' - CGG GAT CCC CCT 
ACA TAC ATA TAT CAG ATT AG - 3' (SEQ. ID. No. 35). The PCR product is 
digested with BamHI and inserted in antisense orientation after a patatin promoter in the 
BamHI site of a plant transformation vector in which the manA gene is used as 

20 selectable marker (see figure 16). 

PRODUCTION OF TRANSGENIC POTATO PLANTS 

Ax p nir SlQgk cultures 

Shoot cultures of Solatium tuberosum 'Bintje' and "Dianella' are maintained on a 
25 substrate (LS) of a formula according to Linsmaier, E.U. and Skoog, F. (1965), Physiol. 
Plant. 18: 100-127, in addition containing 2 uM silver thiosulphate at 25°C and 16 h 
light/8 h dark. 

The cultures are subcultured after approximately 40 days. Leaves are then cut off 
the shoots and cut into nodal segments (approximately 0.8 cm) each containing one node. 

30 
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Tn — vlnTi^n of r ntatn tissues 

Shoots from approximately 40 days old shoot cultures (height approximately 5-6 

cms) are cut into internodal segments (approximately 0.8 cm). The segments are placed 

into liquid LS-substrate containing the transformed Agrobacterium tumefaciens containing 

5 the binary vector of interest. The Agrobacterium are grown overnight in YMB-substrate 
(di-potassium hydrogen phosphate, trihydrate (0.66 g/1); magnesium sulphate, 
heptahydrate (0.20 g/1); sodium chloride (0.10 g/1); mannitol (10.0 g/1); and yeast extract 
(0.40 g/1)) containing appropriate antibiotics (corresponding to the resistance gene of the 
Agrobacterium strain) to an optical density at 660 nm (OD-660) of approximately 0.8, 

10 centrifuged and resuspended in the LS-substrate to an OD-660 of 0.5 . 

The segments are left in the suspension of Agrobacterium for 30 minutes and then 
the excess of bacteria are removed by blotting the segments on sterile filter paper. 

rft-rnltivation 

15 The shoot segments are co-cultured with bacteria for 48 hours directly on LS- 

substrate containing agar (8.0 g/I), 2,4-dichlorophenoxyacetic acid (2.0 mg/1) and trans- 
zeatin (0.5 mg/1). The substrate and also the explants are covered with sterile filter 
papers, and the petri dishes are placed at 25°C and 16 h light/ 8 dark. 

20 "Wf^hiniT" procedure 

After the 48 h on the co-cultivation substrate the segments are transferred to 
containers containing liquid LS-substrate containing 800 mg/1 carbenicillin. The 
containers are gently shaken and by this procedure the major part of the Agrobacterium is 
either washed off the segments and/or killed. 

25 

Selection 

After the washing procedure the segments are transferred to plates containing the 
LS-substrate, agar (8 g/1), trans-zeatin. (1-5 mg/1), gibberellic acid (0.1 mg/1), 
carbenicillin (800 mg/1), and kanamycin sulphate (50-100 mg/1) or phosphinotricin (1-5 
30 mg/1) or mannose (5 g/1) depending on the vector construction used. The segments are 
sub-cultured to fresh substrate each 3-4 weeks. 
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In 3 to 4 weeks, shoots develop from the segments and the formation of new 
shoots continued for 3-4 months. 

Rooting nf regenerated shoots 
5 The regenerated shoots are transferred to rooting substrate composed of LS- 

substrate, agar (8 g/1) and carbenicillin (800 mg/1). 

The transgenic genotype of the regenerated shoot is verified by testing the rooting 
ability on the above mentioned substrates containing kanamycin sulphate (200 mg/1), by 
performing NPTII assays (Radke, S. E. et al, Theor. Appl. Genet. (1988), 75: 685-694) 
10 or by performing PCR analysis according to Wang et al (1993, NAR 21 PP 4153-4154). 
Plants which are not positive in any of these assays are discarded or used as controls. 
Alternatively, the transgenic plants could be verified by performing a GUS assay on the 
co-introduced ^-glucuronidase gene according to Hodal, L. et al. (PI. Sci. (1992), 87: 
115-122). 

15 

The newly rooted plants (height approx. 2-3 cms) are transplanted from rooting 
substrate to soil and placed in a growth chamber (21°C, 16 hour light 200-400uE/m 2 /sec). 
When the plants are well established they are transferred to the greenhouse, where they 
20 are grown until tubers had developed and the upper part of the plants are senescing. 

Harvesting 

The potatoes are harvested after about 3 months and then analysed. 

25 BRANCHING ENZYME ANALYSIS 

The class A and class B SBE expression in the transgenic potato lines is measured 
using the SBE assays described by Blennow and Johansson (Phytochemistry (1991) 
30:437-444) and by standard Western procedures using antibodies directed against potato 
SBE. 



10 
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STARCH ANALYSIS 

Starch is isolated from potato tubers and analysed for the amylose:amylopectin 
ratio (Hovenkamp-Hermelink et al. (1988) Potato Research 31:241-246). In addition, the 
chain length distribution of amylopectin is determined by analysis of isoamylase digested 

starch on a Dionex HPAEC. 

The number of reducing ends in isoamylase digested starch is determined by the 
method described by N. Nelson (1944) J. Biol.Chem. 153:375-380. 

The results reveal that there is a reduction in the level of synthesis of SBE and/or 
the level of activity of SBE and/or the composition of starch SBE in the transgenic plants. 

CONSTRUCTION OF SBE PROMOTER CONSTRUCT 

An SBE promoter fragment is amplified from X-SBE 3.4 using primers: 
5/ CCA TCG ATA CTT TAA GTG ATT TGA TGG C 3' 

(SEQ. ID. No. 36) 

15 and 

5' CGG GAT CCT GTT CTG ATT CTT GAT TTC C 3'. 

(SEQ. ID. No. 37) 

The PCR product is digested with CM and Ba,nHl. The resultant 1.2 kb fragment is then 
inserted in pVictorSa (see Figure 11) linearised with Clal and BglQ. yielding pBEP2 (see 
20 Figure 10). 

^ T A ? r H ANC P TMn PN7.YMF. MF ASUFFMFNTS OF POTATO TUBERS 

Potatoes from potato plants transformed with pBEA8, pBEA9, P SS17 or pSS18 
are cut in small pieces and homogenised in extraction buffer (50 mM Tris-HCl pH 7.5, 
25 Sodium-dithionite (0.1 g/1), and 2 mM DTT) using a Ultra-Turax homogenizer; 1 g of 
Dowex xi: is added pr. 10 g of tuber. The crude homogenate is filtered through a 
' miracloth filter and centrifuged at 4°C for 10 minutes at 24.700 g. The supernatant is 
used for starch branching enzyme assays. 

The starch branching enzyme assays are carried out at 25°C in a volume of 400 pi 
30 composed of 0.1 M Na citrate buffer pH 7.0, 0.75 mg/ml amylose, 5 mg/ml bovine 
serum albumin and the potato extract. At 0, 15, 30 and 60 minutes aliqouts of 50 pi are 
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removed from the reaction into 20 ul 3 N HC1. 1 ml of iodine solution is added and the 
decrease in absorbance at 620 nm is measured with an ELISA spectrophotometer. 

The starch branching enzyme (SBE) levels are measured in tuber extracts from 34 
transgenic Dianella potato plants transformed with plasmid pBEA8, pSS17 and pSS18. 
5 The transformed transgenic lines produce tubers which have SBE levels that are 

10% to 15% of the appropriate class A or class B SBE levels found in non transformed 
Dianella plants. 

In a further experiment, plasmids pSS17 and pBEA8 are cotransfected into potato 
plants, as described above. In the cotransfectants, when analysed as set forth above, 
10 simultaneous reduction of class A and class B SBE levels are observed. 

SUMMATION 

The above-mentioned examples relate to the isolation, sequencing and utilisation 
of.antisense intron constructs derived from a gene for potato class A and class B SBE. 

15 These SBE intron antisense constructs can be introduced into plants, such as potato 
plants. After introduction, a reduction in the level of synthesis of SBE and/or the level of 
activity of SBE and/or the composition of starch in plants can be achieved. 

Without wishing to be bound by theory it is believed that the expressed anti-sense 
nucleotide sequence of the present invention binds to sense introns on pre-mRNA and 

20 thereby prevents pre-mRNA splicing and/or subsequent translation of mRNA. This 
binding therefore is believed to reduce the level of plant enzyme activity (in particular 
class A and class B SBE activity), which in turn for SBE activity is believed to influence 
the amyloseramylopectin ratio and thus the branching pattern of amylopectin. 

Thus, the present invention provides a method wherein it is possible to manipulate 

25 the starch composition in plants, or tissues or cells thereof, such as potato tubers, by 
reducing the level of SBE activity by using an antisense-RNA technique using antisense 
intron sequences. 

The simultaneous reduction or elimination of class A and class B SBE sequences 
from the doubly transformed potato plants, moreover, offers the possibility to transform 
30 such plants with different SBE genes at will, thus allowing the manipulation of branching 
in starch according to the desired result. 
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Other modifications of the present invention will be apparent to those skilled in the 
art without departing from the scope of the present invention. 

The following pages present a number of sequence listings which have been 
consecutively numbered from SEQ.I.D. No. 1 - SEQ.I.D. No. 38. In brief, SEQ.I.D. 
No. 1 - SEQ.I.D. No. 13 represent sense intron sequences (genomic DNA); SEQ.I.D. 
No. 14 represents the SBE promoter sequence (genomic sequence); SEQ.I.D. No. 15 - 
SEQ.I.D. No. 27 represent antisense intron sequences; and SEQ. I.D. No. 28 represents 
is the sequence complementary to the SBE promoter sequence - i.e. the SBE promoter 
sequence in antisense orientation. The full genomic nucleotide sequence for class B SBE 
including the promoter, exons and introns is shown as SEQ. I.D. No. 29 and is explained 
by way of Figures 4 and 12 which highlight particular gene features. SEQ. ID. No. 30 to 
37 show primers used in the methods set forth above. SEQ. ID. No. 38 shows the 
sequence of intron 1 of class A SBE. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: DANISCO A/S 

(B) STREET: LANGEBROGADE 1 

(C) CITY: COPENHAGEN K 
JO (E) COUNTRY: DENMARK 

(F) POSTAL CODE (ZIP) : DK-1001 

(ii) TITLE OF INVENTION: INHIBITION OF GENE EXPRESSION 
15 (iii) NUMBER OF SEQUENCES: 38 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

90 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

ZU (D ) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



25 



(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

35 

(iv) ANTI- SENSE: NO 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTAATTTTTA CTAATTTCAT GTTAATTTCA ATTATTTTTA GCCTTTGCAT TTCATTTTCC 
45 AATATATCTG GATCATCTCC TTAGTTTTTT ATTTTATTTT TTATAATATC AAATATGGAA 
GAAAAATGAC ACTTGTAGAG CCATATGTAA GTATCATGTG ACAAATTTGC AAGGTGGTTG 
AGTGTATAAA ATTCAAAAAT TGAGAGATGG AGGGGGGGTG GGGGAAGACA ATATTTAGAA 
50 AGAGTGTTCT AGGAGGTTAT GGAGGACACG GATGAGGGGT AGAAGGTTAG TTAGGTATTT 
GAGTGTTGTC TGGCTTATCC TTTCATACTA GTAGTCGTGG AATTATTTGG GTAGTTTCTT 
55 GTTTTGTTAT TTGATCTTTG TTATTCTATT TTCTGTTTCT TGTACTTCGA TTATTGTATT 
ATATATCTTG TCGTAGTTAT TGTTCCTCGG TAAGAATGCT CTAGCATGCT TCCTTTAGTG 
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15 



20 



25 



30 



35 



40 



45 



CTTTTACTTT 

AGTCCTCACC 

TATACATAAT 

ACAT AAAGGG 

TATAAGATTT 

TGAAGTTTGT 

ATGTTTTGAT 

AATTGAAAAT 

TGAGAAGCTC 

AGTCAGTCCA 

TAAGTTCTCA 



50 



55 
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TTTTATCATG CCTTCTTTAT ATTCGCGTTG CTTTGAAATG 
CTATTAGAAA CAATCTCTCT ATCTCGTAAG GTAGGGGTAA 
TGTGGGATTA CATTGTGTTT GTTGTTGTAA ATCAATTATG 
TTTACAACAC AAATACATGG TCAAGGGCAA AGTTCTGAAC 
GTCCAGGGAT ATGATAAAAA TTGTTTCTTT GTGAAAGTTA 
TTGCTGGAAA CATAATAAGT TATAATGCTG AGATAGCTAC 
CTTTTAAATG TACCAATAAT AGATTCCGTA TCGAACGAGT 
TGATGTTTCT ATTTTTTACA TTTTTTTGGT GTTGAACTGC 
ATGAGACGGA TAGTTGAGAA TGTGTTCTTT GTATGGACCT 
CCAATAATTT CTATGAATTC AAATTCAGTT TATGGCTACC 
TATGCTGCAT ATACTTGTTC AATTATACTG TAAAATTTCT 
GTAACCTCGA GAATTTCTTT GACAG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GTATGTTTGA TAATTTATAT GGTTGCATGG ATAGTATATA AATAGTTGGA 
ACTGGTGCTC ATGGCATATT TGATCTGTGC ACCGTGTGGA GATGTCAAAC 
TCGTTCCGCC AATTTATAAT ACCTTAACTT GGGAAAGACA GCTCTTTACT 
TTTGTTATTT GAATTACAAT CTTTATGAGC ATGGTGTTTT CACATTATCA 
TGTGGTATAT AACAGTTTTT AGCTCCGTTA ATACCTTTCT TCTTTTTGAT 
TGTGGTGCAT TGCTTGC 
(2) INFORMATION FOR SEQ ID NO: 3: 



AGCCGAGGGT 

ACACTCCACT 

AAGTGGATTT 

TTCATTATAT 

GTTATGGCTT 

TTTTTCTAGC 

TACCTGGTCA 

GTTGTATCCT 

AAACGCTACT 

GAAATTAGGA 

AGATATCCAT 



AAACTTCTGG 
ATGTGTTACT 
CCTGTGGGCA 
ACTTCTTTCA 
ATAAACTAAC 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



15 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
GTAACAGCCA AAAGTTGTGC TTTAGGCAGT TTGACCTTAT TTTGGAAGAT GAATTGTTTA 
TACCTACTTT GACTTTGCTA GAGAATTTTG CATACCGGGG AGTAAGTAGT GGCTCCATTT 
AGGTGGCACC TGGCCATTTT TTTGATCTTT TAAAAAGCTG TTTGATTGGG TCTTCAAAAA 
25 AGTAGACAAG GTTTTTGGAG AAGTGACACA CCCCCGGAGT GTCAGTGGCA AAGCAAAGAT 
TTTCACTAAG GAGATTCAAA ATATAAAAAA AGTATAGACA TAAAGAAGCT GAGGGGATTC 
AACATGTACT ATACAAGCAT CAAATATAGT CTTAAAGCAA TTTTGTAGAA ATAAAGAAAG 
30 TCTTCCTTCT GTTGCTTCAC AATTTCCTTC TATTATCATG AGTTACTCTT TCTGTTCGAA 420 

ATAGCTTCCT TAATATTAAA TTCATGATAC TTTTGTTGAG ATTTAGCAGT TTTTTCTTGT 
35 GTAAACTGCT CTCTTTTTTT GCAG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTAGGTCCTC GTCTACTACA AAATAGTAGT TTCCATCATC ATAACAGATT TTCCTATTAA 



60 
120 
180 
240 
300 
360 



480 

504 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTTTTGTTAT TCATACCTTG AAGCTGAATT TTGAACACCA TCATCACAGG CATTTCGATT 
CATGTTCTTA CTAGTCTTGT TATGTAAGAC ATTTTGAAAT GCAAAAGTTA AAATAATTGT 
GTCTTTACTA ATTTGGACTT GATCCCATAC TCTTTCCCTT AACAAAATGA GTCAATTCTA 
30 TAAGTGCTTG AGAACTTACT ACTTCAGCAA TTAAACAG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 6: 
GTATTTTAAA TTTATTTCTA CAACTAAATA ATTCTCAGAA CAATTGTTAG ATAGAATCCA 
AATATATACG TCCTGAAAGT ATAAAAGTAC TTATTTTCGC CATGGGCCTT CAGAATATTG 
55 GTAGCCGCTG AATATCATGA TAAGTTATTT ATCCAGTGAC ATTTTTATGT TCACTCCTAT 
TATGTCTGCT GGATACAG 



120 
146 
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AGCATGATGT TGCAGCATCA TTGGCTTTCT TACATGTTCT AATTGCTATT AAGGTTATGC 
TTCTAATTAA CTCATCCACA ATGCAG 
5 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

20 



60 
120 
180 
218 



60 
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180 
198 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
• 5 (A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
GTTTGTCTGT TTCTATTGCA TTTTAAGGTT CATATAGGTT AGCCACGGAA AATCTCACTC 
TTTGTGAGGT AACCAGGGTT CTGATGGATT ATTCAATTTT CTCGTTTATC ATTTGTTTAT 
25 • TCTTTTCATG CATTGTGTTT CTTTTTCAAT ATCCCTCTTA TTTGGAGGTA ATTTTTCTCA 
TCTATTCACT TTTAGCTTCT AACCACAG 
(2) INFORMATION FOR SEQ ID NO: 8: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

40 

(iv) ANTI -SENSE: NO 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
' GTATGTCTTA CATCTTTAGA TATTTTGTGA TAATTACAAT TAGTTTGGCT TACTTGAACA 
50 AGATTCATTC CTCAAAATGA CCTGAACTGT TGAACATCAA AGGGGTTGAA ACATAGAGGA 
AAACAACATG ATGAATGTTT CCATTGTCTA GGGATTTCTA TTATGTTGCT GAGAACAAAT 
GTCATCTTAA AAAAAACATT GTTTACTTTT TTGTAGTATA GAAGATTACT GTATAGAGTT 
TGCAAGTGTG TCTGTTTTGG AGTAATTGTG AAATGTTTGA TGAACTTGTA CAG 



60 
120 
180 
208 
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293 



10 



15 



20 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GTTCAAGTAT TTTGAATCGC AGCTTGTTAA ATAATCTAGT AATTTTTAGA TTGCTTACTT 6 
GGAAGTCTAC TTGGTTCTGG GGATGATAGC TCATTTCATC TTGTTCTACT TATTTTCCAA 12 
CCGAATTTCT GATTTTTGTT TCGAGATCCA AGTATTAGAT TCATTTACAC TTATTACCGC U 
CTCATTTCTA CCACTAAGGC CTTGATGAGC AGCTTAAGTT GATTCTTTGA AGCTATAGTT 2. 
TCAGGCTACC AATCCACAGC CTGCTATATT TGTTGGATAC TTACCTTTTC TTTACAATGA 3( 
AGTGATACTA ATTGAAATGG TCTAAATCTG ATATCTATAT TTCTCCGTCT TTCCTCCCCC 31 
TCATGATGAA ATGCAG 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

45 

(iv) ANTI -SENSE: NO 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTAAAATCAT CTAAAGTTGA AAGTGTTGGG TTTATGAAGT GCTTTAATTC TATCCAAGGA 
55 * CAAGTAGAAA CCTTTTTACC TTCCATTTCT TGATGATGGA TTTCATATTA TTTAATCCAA 
TAGCTGGTCA AATTCGGTAA TAGCTGTACT GATTAGTTAC TTCACTTTGC AG 



376 



60 
120 
172 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTATATATGT TTTACTTATC CATGAAATTA TTGCTCTGCT TGTTTTTAAT GTACTGAACA 
AGTTTTATGG AGAAGTAACT GAAACAAATC ATTTTCACAT TGTCTAATTT AACTCTTTTT 
TCTGATCCTC GCATGACGAA AACAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTAAGGATTT GCTTGAATAA CTTTTGATAA TAAGATAACA GATGTAGGGT ACAGTTCTCT 
CACCAAAAAG AACTGTAATT GTCTCATCCA TCTTTAGTTG TATAAGATAT CCGACTGTCT 
GAGTTCGGAA GTGTTTGAGC CTCCTGCCCT CCCCCTGCGT TGTTTAGCTA ATTCAAAAAG 
GAGAAAACTG TTTATTGATG ATCTTTGTCT TCATGCTGAC ATACAATCTG TTCTCATGAC 
AG 

(2) INFORMATION FOR SEQ ID NO: 13: 



60 
120 
145 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTACAGTTCT TGCCGTGTGA CCTCCCTTTT TATTGTGGTT TTGTTCATAG TTATTTGAAT 
GCGATAGAAG TTAACTATTG ATTACCGCCA CAATCGCCAG TTAAGTCCTC TGAACTACTA 
ATTTGAAAGG TAGGAATAGC CGTAATAAGG TCTACTTTTG GCATCTTACT GTTACAAAAC 
AAAAGGATGC CAAAAAAATT CTTCTCTATC CTCTTTTTCC CTAAACCAGT GCATGTAGCT 
TGCACCTGCA TAAACTTAGG TAAATGATCA AAAATGAAGT TGATGGGAAC TTAAAACCGC 
CCTGAAGTAA AGCTAGGAAT AGTCATATAA TGTCCACCTT TGGTGTCTGC GCTAACATCA 
ACAACAACAT ACCTCGTGTA GTCCCACAAA GTGGTTTCAG GGGGAGGGTA GAGTGTATGC 
AAAACTTACT CCTATCTCAG AGGTAGAGAG GATTTTTTCA ATAGACCCTT GGCTCAAGAA 
AAAAAGTCCA AAAAGAAGTA ACAGAAGTGA AAGCAACATG TGTAGCTAAA GCGACCCAAC 
TTGTTTGGGA CTGAAGTAGT TGTTGTTGTT GAAACAGTGC ATGTAGATGA ACACATGTCA 
GAAAATGGAC AACACAGTTA TTTTGTGCAA GTCAAAAAAA TGTACTACTA TTTCTTTGTG 
CAGCTTTATG TATAGAAAAG TTAAATAACT AATGAATTTT GCTAGCAGAA AAATAGCTTG 
GAGAGAAATT TTTTATATTG AACTAAGCTA ACTATATTCA TCTTTCTTTT TGCTTCTTCT 
TCTCCTTGTT TGTGAAG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(x i) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATCATGGCCA ATTACTGGTT . CAAATGCATT ACTTCCTTTC AGATTCTTTC GAGTTCTCAT 
10 GACCGGTCCT ACTACAGACG ATACTAACCC GTGGAACTGT TGCATCTGCT TCTTAGAACT 
CTATGGCTAT TTTCGTTAGC TTGGCGTCGG TTTGAACATA GTTTTTGTTT TCAAACTCTT 
CATTTACAGT CAAAATGTTG TATGGTTTTT GTTTTCCTCA ATGATGTTTA CAGTGTTGTG 
15 TTGTCATCTG TACTTTTGCC TATTACTTGT TTTGAGTTAC ATGTTAAAAA AGTGTTTATT 
TTGCCATATT TTGTTCTCTT ATTATTATTA TCATACATAC ATTATTACAA GGAAAAGACA 
20 AGTACACAGA TCTTAACGTT TATGTTCAAT CAACTTTTGG AGGCATTGAC AGGTACCACA 
AATTTTGAGT TTATGATTAA GTTCAATCTT AGAATATGAA TTTAACATCT ATTATAGATG 
CATAAAAATA GCTAATGATA GAACATTGAC ATTTGGCAGA GCTTAGGGTA TGGTATATCC 
25 AACGTTAATT TAGTAATTTT TGTTACGTAC GTATATGAAA TATTGAATTA ATCACATGAA 
CGGTGGATAT TATATTATGA GTTGGCATCA GCAAAATCAT TGGTGTAGTT GACTGTAGTT 
30 GCAGATTTAA TAATAAAATG GTAATTAACG GTCGATATTA AAATAACTCT CATTTCAAGT 
GGGATTAGAA CTAGTTATTA AAAAAATGTA TACTTTAAGT GATTTGATGG CATATAATTT 
AAAGTTTTTC ATTTCATGCT AAAATTGTTA ATTATTGTAA TGTAGACTGC GACTGGAATT 
35 ATTATAGTGT AAATTTATGC ATTCAGTGTA AAATTAAAGT ATTGAACTTG TCTGTTTTAG 
AAAATACTTT ATACTTTAAT ATAGGATTTT GTCATGCGAA TTTAAATTAA TCGATATTGA 
40 ACACGGAATA CCAAAATTAA AAAGGATACA CATGGCCTTC ATATGAACCG TGAACCTTTG 
ATAACGTGGA AGTTCAAAGA AGGTAAAGTT TAAGAATAAA CTGACAAATT AATTTCTTTT 
ATTTGGCCCA CTACTAAATT TGCTTTACTT TCTAACATGT CAAGTTGTGC CCTCTTAGTT 
45 GAATGATATT CATTTTTCAT CCCATAAGTT CAATTTGATT GTCATACCAC CCATGATGTT 
CTGAAAAATG CTTGGCCATT CACAAAGTTT ATCTTAGTTC CTATGAACTT TATAAGAAGC 
50 TTTAATTTGA CATGTTATTT ATATTAGATG ATATAATCCA TGACCCAATA GACAAGTGTA 
TTAATATTGT AACTTTGTAA TTGAGTGTGT CTACATCTTA TTCAATCATT T AAGGTCATT 
AAAATAAATT ATTTTTTGAC ATTCTAAAAC TTTAAGCAGA ATAAATAGTT TATCAATTAT 
55 TAAAAACAAA AAACGACTTA TTTATAAATC AACAAACAAT TTTAGATTGC TCCAACATAT 



WO 98/37213 PCT/IB98/00270 

39 

TTTTCCAAAT TAAATGCAGA AAATGCATAA TTTTATACTT GATCTTTATA GCTTATTTTT 1S60 

TTTAGCCTAA CCAACGAATA TTTGTAAACT CACAACTTGA TTAAAAGGGA TTTACAACAA 1620 

GATATATATA AGTAGTGACA AATCTTGATT TTAAATATTT TAATTTGGAG GTCAAAATTT 1680 

TACCATAATC ATTTGTATTT ATAATTAAAT TTTAAATATC TTATTTATAC ATATCTAGTA 1740 

AACTTTTAAA TATACGTATA TACAAAATAT AAAATTATTG GCGTTCATAT TAGGTCAATA 1800 

AATCCTTAAC TATATCTGCC TTACCACTAG GAGAAAGTAA AAAACTCTTT ACCAAAAATA 1860 

CATGTATTAT GTATACAAAA AGTCGATTAG ATTACCTAAA TAGAAATTGT ATAACGAGTA 1920 

AGTAAGTAGA AATATAAAAA AACTACAATA CTAAAAAAAA TATGTTTTAC TTCAATTTCG 1980 

AAACTAATGG GGTCTGAGTG AAATATTCAG AAAGGGGAGG ACTAACAAAA GGGTCATAAT 2040 

GTTTTTTTAT AAAAAGCCAC TAAAATGAGG AAATCAAGAA TCAGAACATA CAAGAAGGCA 2100 

GCAGCTGAAG CAAAGTACCA TAATTTAATC AATGGAAATT AATTTCAAAG TTTTATCAAA 2160 
ACCCATTCG 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



2169 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 
CTGTCAAAGA AATTCTCGAG GTTACATGGA TATCTTGAGA ACTTAAGAAA TTTTACAGTA 
TAATTGAACA AGTATATGCA GCATATCCTA ATTTCTGGAC TGACTGGTAG CCATAAACTG 
AATTTGAATT CATAGAAATT ATTGGAGTAG CGTTTGAGCT TCTCAAGGTC CATACAAAGA 
ACACATTCTC AACTATCCGT CTCATAGGAT ACAACATTTT CAATTGCAGT TCAACACCAA 
AAAAATGTAA AAAATAGAAA CATCATGACC AGGTAATCAA AACATACTCG TTCGATACGG 
AATCTATTAT TGGTACATTT AAAAGGCTAG AAAAAACAAA CTTCAGTAGC TATCTCAGCA 
TTATAACTTA TTATGTTTCC AGCAAAAGCC ATAACAAATC TTATATAACT TTCACAAAGA 
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10 



15 



20 



25 



30 



35 



40 



AACAATTTTT ATCATATCCC TGGACATATA ATGAACCCTT 
CTTGACCATG TATTTGTGTT GTAAAAAATC CACTTATTAT 
ACAACAAACA CAATGTAATC CCACAAGTGG AGTGTGGTGA 
GAGATAGAGA GATTGTTTCT AATAGACCCT CGGCTAAAGT 
CGAATATAAA GAAGGCATGA TAAAACACTA AAGGAAGCAT 
GAACAATAAC TACGACAAGA TATATAATAC AATAATCGAA 
AATAACAAAG ATCAAATAAC AAAACAAGAA ACTACCCAAA 
TGAAAGGATA AGCCAGACAA CACTCAAATA CCTAACTAAC 
CCTCCATAAC CTCCTAGAAC ACTCTTTCTA AATATTGTCT 
TCTCAATTTT TGAATTTTAT ACACTCAACC ACCTTGCAAA 
TATGGCTCTA CAAGTGTCAT TTTTCTTCCA TATTTGATAT 
ACTAAGGAGA TGATCCAGAT ATATTGGAAA ATGAAATGCA 
TTAACATGAA ATTAGTAAAA ATTAC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



TATGTGTTCA 
GTATACATAA 
GGACTTTACC 
AAAAGCATTT 
GCTAGAGCAT 
GTACAAGAAA 
TAATTCCACG 
CTTCTACCCC 
TCCCCCACCC 
TTTGTCACAT 
TATAAAAAAT 
AAGGCTAAAA 



GAACTTTGCC 

TTGATTTACA 

CCTACCTTAC 

CAAAGCAACG 

TCTTACCGAG 

CAGAAAATAG 

ACTACTAGTA 

TCATCCGTGT 

CCCCTCCATC 

GATACTTACA 

AAAATAAAAA 

ATAATTGAAA 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1165 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GCAAGCAATG CACCACAGTT AGTTTATATC AAAAAGAAGA AAGGTATTAA 
AACTGTTATA TACCACATGA AAGAAGTTGA TAATGTGAAA ACACCATGCT 
50 GTAATTCAAA TAACAAATGC CCACAGGAGT AAAGAGCTGT CTTTCCCAAG 
ATAAATTGGC GGAACGAAGT AACACATGTT TGACATCTCC ACACGGTGCA 
ATGCCATGAG CACCAGTCCA GAAGTTTTCC AACTATTTAT ATACTATCCA 
TAAATTATCA AACATAC 



55 



CGGAGCTAAA 
CATAAAGATT 
TTAAGGTATT 
CAGATCAAAT 
TGCAACCATA 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTGCAAAAAA AGAGAGCAGT TTACACAAGA AAAAACTGCT AAATCTCAAC AAAAGTATCA 
TGAATTTAAT ATTAAGGAAG CTATTTCGAA CAGAAAGAGT AACTCATGAT AATAGAAGGA 
AATTGTGAAG CAACAGAAGG AAGACTTTCT TTATTTCTAC AAAATTGCTT TAAGACTATA 
TTTGATGCTT GTATAGTACA TGTTGAATCC CCTCAGCTTC TTTATGTCTA TACTTTTTTT 
ATATTTTGAA TCTCCTTAGT GAAAATCTTT GCTTTGCCAC TGACACTCCG GGGGTGTGTC 
ACTTCTCCAA AAACCTTGTC TACTTTTTTG AAGACCCAAT CAAACAGCTT TTTAAAAGAT 
CAAAAAAATG GCCAGGTGCC ACCTAAATGG AGCCACTACT TACTCCCCGG TATGCAAAAT 
TCTCTAGCAA AGTCAAAGTA GGTATAAACA ATTCATCTTC CAAAATAAGG TCAAACTGCC 



360 
420 
480 



35 504 
TAAAGCACAA CTTTTGGCTG TTAC 

(2) INFORMATION FOR SEQ ID NO: 18: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
50 (iv) ANTI- SENSE: YES 



55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

CTGCATTGTG GATGAGTTAA TTAGAAGCAT AACCTTAATA GCAATTAGAA CATGTAAGAA 



60 
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AG CCAATGAT GCTGCAACAT CATGCTTTAA TAGGAAAATC TGTTATGATG ATGGAAACTA 120 
CTATTTTGTA GTAGACGAGG ACCTAC 146 

5 

(2) INFORMATION FOR SEQ ID NO : 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



20 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 CTGTTTAATT GCTGAAGTAG TAAGTTCTCA AGCACTTATA GAATTGACTC ATTTTGTTAA 60 

GGGAAAGAGT ATGGGATCAA GTCCAAATTA GTAAAGACAC AATTATTTTA ACTTTTGCAT 120 

TTCAAAATGT CTTACATAAC AAGACTAGTA AGAACATGAA TCGAAATGCC TGTGATGATG 180 

30 218 
GTGTTCAAAA TTCAGCTTCA AGGTATGAAT AACAAAAC 

(2) INFORMATION FOR SEQ ID NO: 20: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
45 (iv) ANTI- SENSE: YES 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CTGTATCCAG CAGACATAAT AGGAGTGAAC ATAAAAATGT CACTGGATAA ATAACTTATC 
ATGATATTCA GCGGCTACCA ATATTCTGAA GGCCCATGGC GAAAATAAGT ACTTTTATAC 
TTTCAGGACG TATATATTTG GATTCTATCT AACAATTGTT CTGAGAATTA TTTAGTTGTA 



55 



60 
120 
180 
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GAAATAAATT TAAAATAC . 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CTGTGGTTAG AAGCTAAAAG TGAATAGATG AGAAAAATTA CCTCCAAATA AGAGGGATAT 
TGAAAAAGAA ACACAATGCA TGAAAAGAAT AAACAAATGA TAAACGAGAA AATTGAATAA 
TCCATCAGAA CCCTGGTTAC CTCACAAAGA GTGAGATTTT CCGTGGCTAA CCTATATGAA 
CCTTAAAATG CAATAGAAAC AGACAAAC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CTGTACAAGT TCATCAAACA TTTCACAATT ACTCCAAAAC AGACACACTT GCAAACTCTA 
TACAGTAATC TTCTATACTA CAAAAAAGTA AACAATGTTT TTTTTAAGAT GACATTTGTT 
CTCAGCAACA TAATAGAAAT CCCTAGACAA TGGAAACATT CATCATGTTG TTTTCCTCTA 
TGTTTCAACC CCTTTGATGT TCAACAGTTC AGGTCATTTT GAGGAATGAA TCTTGTTCAA 
GTAAGCCAAA CTAATTGTAA TTATCACAAA ATATCTAAAG ATGTAAGACA TAC 



60 
120 
180 
208 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 3 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 

15 



20 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTGCATTTCA TCATGAGGGG GAGGAAAGAC GGAGAAATAT AGATATCAGA TTTAGACCAT 
TTCAATTAGT ATCACTTCAT TGTAAAGAAA AGGTAAGTAT CCAACAAATA TAGCAGGCTG 
25 TGGATTGGTA GCCTGAAACT ATAGCTTCAA AGAATCAACT TAAGCTGCTC ATCAAGGCCT 
TAGTGGTAGA AATGAGGCGG TAATAAGTGT AAATGAATCT AATACTTGGA TCTCGAAACA 
AAAATCAGAA. ATTCGGTTGG AAAATAAGTA GAACAAGATG AAATGAGCTA TCATCCCCAG 
AACCAAGTAG ACTTCCAAGT AAGCAATCTA AAAATTACTA GATTATTTAA CAAGCTGCGA 
TTCAAAATAC TTGAAC 
35 (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 172 base pairs 
(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

45 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24:. 
CTGCAAAGTG AAGTAACTAA TCAGTACAGC TATTACCGAA TTTGACCAGC TATTGGATTA 
AATAATATGA AATCCATCAT CAAGAAATGG AAGGTAAAAA GGTTTCTACT TGTCCTTGGA 



60 
120 
180 
240 
300 
360 
376 
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TAGAATTAAA GCACTTCATA AACCCAACAC TTTCAACTTT AGATGATTTT AC 172 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

15 (iv) ANTI- SENSE: YES 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CTGTTTTCGT CATGCGAGGA TCAGAAAAAA GAGTTAAATT AGACAATGTG AAAATGATTT 60 

GTTTCAGTTA CTTCTCCATA AAACTTGTTC AGTACATTAA AAACAAGCAG AGCAATAATT 120 

25 145 
TCATGGATAA GTAAAACATA TATAC 

(2) INFORMATION FOR SEQ ID NO: 26: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
40 (iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGTCATGAG AACAGATTGT ATGTCAGCAT GAAGACAAAG ATCATCAATA AACAGTTTTC 
TCCTTTTTGA ATTAGCTAAA CAACGCAGGG GGAGGGCAGG AGGCTCAAAC ACTTCCGAAC 



60 
120 



30 TCAGACAGTC GGATATCTTA TACAACTAAA GATGGATGAG ACAATTACAG TTCTTTTTGG 180 
TGAGAGAACT GTACCCTACA TCTGTTATCT TATTATCAAA AGTTATTCAA GCAAATCCTT 

55 ac 

(2) information for seq id no: 27: 



240 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CTTCACAAAC AAGGAGAAGA AGAAGCAAAA AGAAAGATGA ATATAGTTAG CTTAGTTCAA 
TATAAAAAAT TTCTCTCCAA GCTATTTTTC TGCTAGCAAA ATTCATTAGT TATTTAACTT 
TTCTATACAT AAAGCTGCAC AAAGAAATAG TAGTACATTT TTTTGACTTG CACAAAATAA 
CTGTGTTGTC CATTTTCTGA CATGTGTTCA TCTACATGCA CTGTTTCAAC AACAACAACT 
ACTTCAGTCC CAAACAAGTT GGGTCGCTTT AGCTACACAT GTTGCTTTCA CTTCTGTTAC 
TTCTTTTTGG ACTTTTTTTC TTGAGCCAAG GGTCTATTGA AAAAATCCTC TCTACCTCTG 
AGATAGGAGT AAGTTTTGCA TACACTCTAC CCTCCCCCTG AAACCACTTT GTGGGACTAC 
ACGAGGTATG TTGTTGTTGA TGTTAGCGCA GACACCAAAG GTGGACATTA TATGACTATT 
CCTAGCTTTA CTTCAGGGCG GTTTTAAGTT CCCATCAACT TCATTTTTGA TCATTTACCT 
AAGTTTATGC AGGTGCAAGC TACATGCACT GGTTTAGGGA AAAAGAGGAT AGAGAAGAAT 
TTTTTTGGCA TCCTTTTGTT TTGTAACAGT AAGATGCCAA AAGTAGACCT TATTACGGCT 
ATTCCTACCT TTCAAATTAG TAGTTCAGAG GACTTAACTG GCGATTGTGG CGGTAATCAA 
TAGTTAACTT CTATCGCATT CAAATAACTA TGAACAAAAC CACAATAAAA AGGGAGGTCA 
CACGGCAAGA ACTGTAC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGAATGGGTT TTGATAAAAC TTTGAAATTA ATTTCCATTG ATTAAATTAT GGTACTTTGC 60 
TTCAGCTGCT GCCTTCTTGT ATGTTCTGAT TCTTGATTTC CTCATTTTAG TGGCTTTTTA 
TAAAAAAACA TTATGACCCT TTTGTTAGTC CTCCCCTTTC TGAATATTTC ACTCAGACCC 
CATTAGTTTC GAAATTGAAG TAAAACATAT TTTTTTTAGT ATTGTAGTTT TTTTATATTT 
CTACTTACTT ACTCGTTATA CAATTTCTAT TTAGGTAATC TAATCGACTT TTTGTATACA 
TAATACATGT ATTTTTGGTA AAGAGTTTTT TACTTTCTCC TAGTGGTAAG GCAGATATAG 
TTAAGGATTT ATTGACCTAA TATGAACGCC AATAATTTTA TATTTTGTAT ATACGTATAT 
TTAAAAGTTT ACTAGATATG TATAAATAAG ATATTTAAAA TTTAATTATA AATACAAATG 
ATTATGGTAA AATTTTGACC TCCAAATTAA AATATTTAAA ATCAAGATTT GTCACTACTT 
ATATATATCT TGTTGTAAAT CCCTTTTAAT CAAGTTGTGA GTTTACAAAT ATTCGTTGGT 
TAGGCTAAAA AAAATAAGCT ATAAAGATCA AGTATAAAAT TATGCATTTT CTGCATTTAA 
TTTGGAAAAA TATGTTGGAG CAATCTAAAA TTGTTTGTTG ATTTATAAAT AAGTCGTTTT 
TTGTTTTTAA TAATTGATAA ACTATTTATT CTGCTTAAAG TTTTAGAATG TCAAAAAATA 
ATTTATTTTA ATGACCTTAA ATGATTGAAT AAGATGTAGA CACACTCAAT TACAAAGTTA 
CAATATTAAT ACACTTGTCT ATTGGGTCAT GGATTATATC ATCTAATATA AATAACATGT 
CAAATTAAAG CTTCTTATAA AGTTCATAGG AACTAAGATA AACTTTGTGA ATGGCCAAGC 
ATTTTTCAGA ACATCATGGG TGGTATGACA ATCAAATTGA ACTTATGGGA TGAAAAATGA 
ATATCATTCA ACTAAGAGGG CACAACTTGA CATGTTAGAA AGTAAAGCAA ATTTAGTAGT 
GGGCCAAATA AAAGAAATTA ATTTGTCAGT TTATTCTTAA ACTTTACCTT CTTTGAACTT 
CCACGTTATC AAAGGTTCAC GGTTCATATG AAGGCCATGT GTATCCTTTT TAATTTTGGT 
ATTCCGTGTT CAATATCGAT TAATTTAAAT TCGCATGACA AAATCCTATA TTAAAGTATA 
AAGTATTTTC TAAAACAGAC AAGTTCAATA CTTTAATTTT ACACTGAATG CATAAATTTA 
CACTATAATA ATTCCAGTCG CAGTCTACAT TACAATAATT AACAATTTTA GCATGAAATG 
AAAAACTTTA AATTATATGC CATCAAATCA CTTAAAGTAT ACATTTTTTT AATAACTAGT 
TCTAATCCCA CTTGAAATGA GAGTTATTTT AATATCGACC GTTAATTACC ATTTTATTAT 
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TAAATCTGCA ACTACAGTCA ACTACACCAA TGATTTTGCT 
TATCCACCGT TCATGTGATT AATTCAATAT TTCATATACG 
ATTAACGTTG GATATACCAT ACCCTAAGCT CTGCCAAATG 
ATTTTTATGC ATCTATAATA GATGTTAAAT TCATATTCTA 
CTCAAAATTT GTGGTACCTG TCAATGCCTC CAAAAGTTGA 
CTGTGTACTT GTCTTTTCCT TGTAATAATG TATGTATGAT 
ATATGGCAAA ATAAACACTT TTTTAACATG TAACTCAAAA 
AGATGACAAC ACAACACTGT AAACATCATT GAGGAAAACA 
CTGTAAATGA AGAGTTTGAA AACAAAAACT ATGTTCAAAC 
TAGCCATAGA GTTCTAAGAA GCAGATGCAA CAGTTCCACG 
GGACCGGTCA TGAGAACTCG AAAGAATCTG AAAGGAAGTA 
GGCCATGAT 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



GATGCCAACT 

TACGTAACAA 

TCAATGTTCT 

AGATTGAACT 

TTGAACATAA 

AATAATAATA 

CAAGTAATAG 

AAAACCATAC 

CGACGCCAAG 

GGTTAGTATC 

ATGCATTTGA 



CATAATATAA 

AAATTACTAA 

ATCATTAGCT 

TAATCATAAA 

ACGTTAAGAT 

AGAGAACAAA 

GCAAAAGTAC 

AACATTTTGA 

CTAACGAAAA 

GTCTGTAGTA 

ACCAGTAATT 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
45 ATCATGGCCA ATTACTGGTT CAAATGCATT ACTTCCTTTC AGATTCTTTC 
GACCGGTCCT ACTACAGACG ATACTAACCC GTGGAACTGT TGCATCTGCT 
CTATGGCTAT TTTCGTTAGC TTGGCGTCGG TTTGAACATA GTTTTTGTTT 
CATTTACAGT CAAAATGTTG TATGGTTTTT GTTTTCCTCA ATGATGTTTA 
TTGTCATCTG TACTTTTGCC TATTACTTGT TTTGAGTTAC ATGTTAAAAA 
55 TTGCCATATT TTGTTCTCTT ATTATTATTA TCATACATAC ATTATTACAA 
AGTACACAGA TCTTAACGTT TATGTTCAAT CAACTTTTGG AGGCATTGAC 



50 



GAGTTCTCAT 

TCTTAGAACT 

TCAAACTCTT 

CAGTGTTGTG 

AGTGTTTATT 

GGAAAAGACA 

AGGTACCACA 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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AATTTTGAGT TTATGATTAA GTTCAATCTT 
CATAAAAATA GCTAATGATA GAACATTGAC 
AACGTTAATT TAGTAATTTT TGTTACGTAC 
CGGTGGATAT TATATTATGA GTTGGCATCA 
GCAGATTTAA TAATAAAATG GTAATTAACG 
GGGATTAGAA CTAGTTATTA AAAAAATGTA 
AAAGTTTTTC ATTTCATGCT AAAATTGTTA 
ATTATAGTGT AAATTTATGC ATTCAGTGTA 
AAAATACTTT ATACTTTAAT ATAGGATTTT 
ACACGGAATA CCAAAATTAA AAAGGATACA 
ATAACGTGGA AGTTCAAAGA AGGTAAAGTT 
ATTTGGCCCA CTACTAAATT TGCTTTACTT 
GAATGATATT CATTTTTCAT CCCATAAGTT 
CTGAAAAATG CTTGGCCATT CACAAAGTTT 
TTTAATTTGA CATGTTATTT ATATTAGATG 
TTAATATTGT AACTTTGTAA TTGAGTGTGT 
AAAATAAATT ATTTTTTGAC ATTCTAAAAC 
TAAAAACAAA AAACGACTTA TTTATAAATC 
TTTTCCAAAT TAAATGCAGA AAATGCATAA 
TTTAGCCTAA CCAACGAATA TTTGTAAACT 
GATATATATA AGTAGTGACA AATCTTGATT 
TACCATAATC ATTTGTATTT ATAATTAAAT 
AACTTTTAAA TATACGTATA TACAAAATAT 
AATCCTTAAC TATATCTGCC TTACCACTAG 
CATGTATTAT GTATACAAAA AGTCGATTAG 
AGTAAGTAGA AATATAAAAA AACTACAATA 
AAACTAATGG GGTCTGAGTG AAATATTCAG 
GTTTTTTTAT AAAAAGCCAC TAAAATGAGG 



AGAATATGAA 
ATTTGGCAGA 
GTATATGAAA 
GCAAAATCAT 
GTCGATATTA 
TACTTTAAGT 
ATTATTGTAA 
AAATTAAAGT 
GTCATGCGAA 
CATGGCCTTC 
TAAGAATAAA 
TCTAACATGT 
CAATTTGATT 
ATCTTAGTTC 
ATATAATCCA 
CTACATCTTA 
TTTAAGCAGA 
AACAAACAAT 
TTTTATACTT 
C ACAACTTG A 
TTAAATATTT 
TTTAAATATC 
AAAATTATTG 
GAGAAAGTAA 
ATTACCTAAA 
CTAAAAAAAA 
AAAGGGGAGG 
AAATCAAGAA 



TTTAACATCT 
GCTTAGGGTA 
TATTGAATTA 
TGGTGTAGTT 
AAATAACTCT 
GATTTGATGG 
TGTAGACTGC 
ATTGAACTTG 
TTTAAATTAA 
ATATGAACCG 
CTGACAAATT 
CAAGTTGTGC 
GTCATACCAC 
CTATGAACTT 
TGACCCAATA 
TTCAATCATT 
ATAAATAGTT 
TTTAGATTGC 
GATCTTTATA 
TTAAAAGGGA 
TAATTTGGAG 
TTATTTATAC 
GCGTTCATAT 
AAAACTCTTT 
TAGAAATTGT 
TATGTTTTAC 
ACTAACAAAA 
TCAGAACATA 



ATTATAGATG 
TGGTATATCC 
ATCACATGAA 
GACTGTAGTT 
CATTTCAAGT 
CATATAATTT 
GACTGGAATT 
TCTGTTTTAG 
TCGATATTGA 
TGAACCTTTG 
AATTTCTTTT 
CCTCTTAGTT 
CCATGATGTT 
TATAAGAAGC 
GACAAGTGTA 
TAAGGTCATT 
TATCAATTAT 
TCCAACATAT 
GCTTATTTTT , 
TTTACAACAA 
GTCAAAATTT 
ATATCTAGTA 
TAGGTCAATA 
ACCAAAAATA 
ATAACGAGTA 
TTCAATTTCG 
GGGTCATAAT 
CAAGAAGGCA 
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GCAGCTGAAG CAAAGTACCA TAATTTAATC AATGGAAATT 
ACCCATTCGA GGATCTTTTC CATCTTTCTC ACCTAAAGTT 
TAATTTCATG TTAATTTCAA TTATTTTTAG CCTTTGCATT 
ATCATCTCCT TAGTTTTTTA TTTTATTTTT TATAATATCA 
CTTGTAGAGC CATATGTAAG TATCATGTGA CAAATTTGCA 
TTCAAAAATT GAGAGATGGA GGGGGGGTGG GGGAAGACAA 
GGAGGTTATG GAGGACACGG ATGAGGGGTA GAAGGTTAGT 
GGCTTATCCT TTCATACTAG TAGTCGTGGA ATTATTTGGG 
TGATCTTTGT TATTCTATTT TCTGTTTCTT GTACTTCGAT 
CGTAGTTATT GTTCCTCGGT AAGAATGCTC TAGCATGCTT 
CTTCTTTATA TTCGCGTTGC TTTGAAATGC TTTTACTTTA 
AATCTCTCTA TCTCGTAAGG TAGGGGTAAA GTCCTCACCA 
ATTGTGTTTG TTGTTGTAAA TCAATTATGT ATACATAATA 
AATACATGGT CAAGGGCAAA GTTCTGAACA CATAAAGGGT 
TGATAAAAAT TGTTTCTTTG TGAAAGTTAT ATAAGATTTG 
ATAATAAGTT ATAATGCTGA GATAGCTACT GAAGTTTGTT 
ACCAATAATA GATTCCGTAT CGAACGAGTA TGTTTTGATT 
TTTTTTACAT TTTTTTGGTG TTGAACTGCA ATTGAAAATG 
AGTTGAGAAT GTGTTCTTTG TATGGACCTT GAGAAGCTCA 
TATGAATTCA AATTCAGTTT ATGGCTACCA GTCAGTCCAG 
TACTTGTTCA ATTATACTGT AAAATTTCTT AAGTTCTCAA 
AATTTCTTTG ACAGGCTTCT AGAAATAAGA TATGTTTTCC 
TGAAGTTTGG ATCTCAGGAA CGGTCTTGGG ATATTTCTTC 
GAAAAGATGA AAGGGTATGT TTGATAATTT ATATGGTTGC 
TGGAAAACTT CTGGACTGGT GCTCATGGCA TATTTGATCT 
AAACATGTGT TACTTCGTTC CGCCAATTTA TAATACCTTA 
TACTCCTGTG GGCATTTGTT ATTTGAATTA CAATCTTTAT 
ATCAACTTCT TTCATGTGGT ATATAACAGT TTTTAGCTCC 
TGATATAAAC TAACTGTGGT GCATTGCTTG CATGAAGCAC 



AATTTCAAAG 
TCTTCAGGGG 
TCATTTTCCA 
AATATGGAAG 
AGGTGGTTGA 
TATTTAGAAA 
TAGGTATTTG 
TAGTTTCTTG 
TATTGTATTA 
CCTTTAGTGT 
GCCGAGGGTC 
CACTCCACTT 
AGTGGATTTT 
TCATTATATG 
TTATGGCTTT 
TTTTCTAGCC 
ACCTGGTCAT 
TTGTATCCTA 
AACGCTACTC 
AAATTAGGAT 
GATATCCATG 
TTCTCAACAT 
CACCCCAAAA 
ATGGATAGTA 
GTGCACCGTG 
ACTTGGGAAA 
GAGCATGGTG 
GTTAATACCT 
AGTTCAGCTA 



TTTTATCAAA 
TAATTTTTAC 
ATATATCTGG 
AAAAATGACA 
GTGTATAAAA 
GAGTGTTCTA 
AGTGTTGTCT 
TTTTGTTATT 
TATATCTTGT 
TTTATCATGC 
TATTAGAAAC 
GTGGGATTAC 
TTACAACACA 
TCCAGGGATA 
TGCTGGAAAC 
TTTTAAATGT 
GATGTTTCTA 
TGAGACGGAT 
CAATAATTTC 
ATGCTGCATA 
TAACCTCGAG 
AGTACTGGAC 
TCAAGAGTTA 
TATAAATAGT 
TGGAGATGTC 
GACAGCTCTT 
TTTTCACATT. 
TTCTTCTTTT 
TTTCCGCTGT 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 



WO 98/37213 



PCT/IB98/00270 



I 



3 



5 



40 



45 



50 



55 



51 

TTTGACCGAT GACGACAATT CGACAATGGC ACCCCTAGAG GAAGATGTCA AGACTGAAAA 
TATTGGCCTC CTAAATTTGG ATCCAACTTT GGAACCTTAT CTAGATCACT TCAGACACAG 
AATGAAGAGA TATGTGGATC AGAAAATGCT CATTGAAAAA TATGAGGGAC CCCTTGAGGA 
ATTTGCTCAA GGTAACAGCC AAAAGTTGTG CTTTAGGCAG TTTGACCTTA TTTTGGAAGA 
TGAATTGTTT ATACCTACTT TGACTTTGCT AGAGAATTTT GCATACCGGG GAGTAAGTAG 
TGGCTCCATT TAGGTGGCAC CTGGCCATTT TTTTGATCTT TTAAAAAGCT GTTTGATTGG 
GTCTTCAAAA AAGTAGACAA GGTTTTTGGA GAAGTGACAC ACCCCCGGAG TGTCAGTGGC 
AAAGCAAAGA TTTTCACTAA GGAGATTCAA AATATAAAAA AAGTATAGAC ATAAAGAAGC 
TGAGGGGATT CAACATGTAC TATACAAGCA TCAAATATAG TCTTAAAGCA ATTTTGTAGA 
AATAAAGAAA GTCTTCCTTC TGTTGCTTCA CAATTTCCTT CTATTATCAT GAGTTACTCT 
TTCTGTTCGA AATAGCTTCC TTAATATTAA ATTCATGATA CTTTTGTTGA GATTTAGCAG 
TTTTTTCTTG TGTAAACTGC TCTCTTTTTT TGCAGGTTAT TTAAAATTTG GATTCAACAG 
GGAAGATGGT TGCATAGTCT ATCGTGAATG GGCTCCTGCT GCTCAGTAGG TCCTCGTCTA 
CTACAAAATA GTAGTTTCCA TCATCATAAC AGATTTTCCT ATTAAAGCAT GATGTTGCAG 
CATCATTGGC TTTCTTACAT GTTCTAATTG CTATTAAGGT TATGCTTCTA ATTAACTCAT 
CCACAATGCA GGGAAGCAGA AGTTATTGGC GATTTCAATG GATGGAACGG TTCTAACCAC 
ATGATGGAGA AGGACCAGTT TGGTGTTTGG AGTATTAGAA TTCCTGATGT TGACAGTAAG 
CCAGTCATTC CACACAACTC CAGAGTTAAG TTTCGTTTCA AACATGGTAA TGGAGTGTGG 
GTAGATCGTA TCCCTGCTTG GATAAAGTAT GCCACTGCAG ACGCCACAAA GTTTGCAGCA 
CCATATGATG GTGTCTACTG GGACCCACCA CCTTCAGAAA GGTTTTGTTA TTCATACCTT 
GAAGCTGAAT TTTGAACACC ATCATCACAG GCATTTCGAT TCATGTTCTT ACTAGTCTTG 
TTATGTAAGA CATTTTGAAA TGCAAAAGTT AAAATAATTG TGTCTTTACT AATTTGGACT 
TGATCCCATA CTCTTTCCCT TAACAAAATG AGTCAATTCT ATAAGTGCTT GAGAACTTAC 
TACTTCAGCA ATTAAACAGG TACCACTTCA AATACCCTCG CCCTCCCAAA CCCCGAGCCC 
CACGAATCTA TGAAGCACAT GTCGGCATGA GCAGCTCTGA GCCACGTGTA AATTCGTATC 
GTGAGTTTGC AGATGATGTT TTACCTCGGA TTAAGGCAAA TAACTATAAT ACTGTCCAGT 
TGATGGCCAT AATGGAACAT TCTTACTATG GATCATTTGG ATATCATGTT ACAAACTTTT 
TTGCTGTGAG CAGTAGATAT GGAAACCCGG AGGACCTAAA GTATCTGATA GATAAAGCAC 
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ATAGCTTGGG 
CTGATGGCCT 
GAGAGCGAGG 
TTCTTCGTTT 
TTCGATTTGA 
CAGGAAACTA 
TGTTGGCCAA 
TTTCTGGTAT 
GCCTGGCAAT 
ATTGGTCCAT 
TAGCATATGC 
TCAGAACAAT 
TTTCGCCATG 
AGTGACATTT 
AGACCATTGC 
ATGCTTCTCC 
ATTTTAAGGT 
TCTGATGGAT 
TCTTTTTCAA 
TAACCACAGA 
ATGGGTAACG 
CTTACTTGAA 
AAACATAGAG 
CTGAGAACAA 
CTGTATAGAG 
TACAGTTTGG 
ACAAATGTAG 
TATTTTGAAT 
TACTTGGTTC 



TTTACAGGTT 
CAATGGCTTT 
GTACCATAAG 
CCTTCTTTCC 
TGGAATAACT 
TAATGAGTAT 
TAATCTGATT 
GCCGGGCCTT 
GGCAATCCCA 
GAAGGAAGTA 
GGAGACCCAT 
TGTTAGATAG 
GGCCTTCAGA 
TTATGTTCAC 
ATTTCTCCTA 
TGTTGTTGAT 
TCATATAGGT 
TATTCAATTT 
TATCCCTCTT 
TGATCCATTT 
AGGTATGTCT 
CAAGATTCAT 
GAAAACAACA 
ATGTCATCTT 
TTTGCAAGTG 
CCATCCTGAG 
ACGCCAGTGG 
CGCAGCTTGT 
TGGGGATGAT 



CTGGTGGATG 
GATATTGGCC 
TTGTGGGATA 
AACTTGAGGT 
TCTATGCTGT 
TTCAGCGAGG 
CACAAGATTT 
GGCCGGCCTG 
GATAAGTGGA 
ACATCGAGTT 
GATCAGGTAT 
AATCCAAATA 
ATATTGGTAG 
TCCTATTATG 
ATGGACAAAG 
CGAGGAATTG 
TAGCCACGGA 
TCTCGTTTAT 
ATTTGGAGGT 
TTTCACAATG 
TACATCTTTA 
TCCTCAAAAT 
TGATGAATGT 
AAAAAAAACA 
TGTCTGTTTT 
TGGATTGACT 
AACCTCGCGG 
TAAATAATCT 
AGCTCATTTC 



52 

TAGTTCACAG 
AAGGTTCTCA 
GCAGGCTGTT 
GGTGGCTAGA 
ATGTTCATCA 
CTACAGATGT 
TCCCAGATGC 
TTTCTGAGGG 
TAGATTATTT 
TGACAAATAG 
TTTAAATTTA 
TATACGTCCT 
CCGCTGAATA 
TCTGCTGGAT 
AGATGTATTC 
CGCTTCACAA 
AAATCTCACT 
CATTTGTTTA 
AATTTTTCTC 
GCCTTGGGAG 
GATATTTTGT 
GACCTGAACT 
TTCCATTGTC 
TTGTTTACTT 
GGAGTAATTG 
TCCCTAGAGA 
ATAGCGAACA 
AGTAATTTTT 
ATCTTGTTCT 



TCATGCAAGC 
AGAATCCTAC 
CAACTATGCC 
AGAGTATAAC 
TGGAATCAAT 
TGATGCTGTG 
AACTGTTATT 
AGGAATTGGT 
AAAGAATAAG 
GAGATATACA 
TTTCTACAAC 
GAAAGTATAA 
TCATGATAAG 
ACAGTCTATT 
TGGCATGTCT 
GGTTTGTCTG 
CTTTGTGAGG 
TTCTTTTCAT 
ATCTATTCAC 
GAGAGGGGTA 
GATAATTACA 
GTTGAACATC 
TAGGGATTTC 
TTTTGTAGTA 
TGAAATGTTT 
GGGCAATAAT 
CTTGAGATAC 
AGATTGCTTA 
ACTTATTTTC 



AATAATGTCA 
TTTCATGCTG 
AATTGGGAGG 
TTTGACGGAT 
ATGGGATTTA 
GTCTATTTAA 
GCCGAAGATG 
TTTGTTTACC 
AATGATGAAG 
GAGAAGTGTA 
TAAATAATTC 
AAGTACTTAT 
TTATTTATCC 
GTTGGTGACA 
TGCTTGACAG 
TTTCTATTGC 
TAACCAGGGT 
GCATTGTGTT 
TTTTAGCTTC 
CCTCAATTTC 
ATTAGTTTGG 
AAAGGGGTTG 
TATTATGTTG 
TAGAAGATTA 
GATGAACTTG 
TGGAGTTATG 
AAGGTTCAAG 
CTTGGAAGTC 
CAACCGAATT 
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TCTGATTTTT GTTTCGAGAT CCAAGTATTA GATTCATTTA CACTTATTAC CGCCTCATTT 
CTACCACTAA GGCCTTGATG AGCAGCTTAA GTTGATTCTT TGAAGCTATA GTTTCAGGCT 
ACCAATCCAC AGCCTGCTAT ATTTGTTGGA TACTTACCTT TTCTTTACAA TGAAGTGATA 
CTAATTGAAA TGGTCTAAAT CTGATATCTA TATTTCTCCG TCTTTCCTCC CCCTCATGAT 
GAAATGCAGT TTATGAATGC ATTTGATAGA GCTATGAATT CGCTCGATGA AAAGTTCTCA 
TTCCTCGCAT CAGGAAAACA GATAGTAAGC AGCATGGATG ATGATAATAA GGTAAAATCA 
TCTAAAGTTG AAAGTGTTGG GTTTATGAAG TGCTTTAATT CTATCCAAGG ACAAGTAGAA 
ACCTTTTTAC CTTCCATTTC TTGATGATGG ATTTCATATT ATTTAATCCA ATAGCTGGTC 
AAATTCGGTA ATAGCTGTAC TGATTAGTTA CTTCACTTTG CAGGTTGTTG TGTTTGAACG 
TGGTGACCTG GTATTTGTAT TCAACTTCCA CCCAAAGAAC ACATACGAAG GGTATATATG 
TTTTACTTAT CCATGAAATT ATTGCTCTGC TTGTTTTTAA TGTACTGAAC AAGTTTTATG 
GAGAAGTAAC TGAAACAAAT CATTTTCACA TTGTCTAATT TAACTCTTTT TTCTGATCCT 
CGCATGACGA AAACAGGTAT AAAGTTGGAT GTGACTTGCC AGGGAAGTAC AGAGTTGCAC 
TGGACAGTGA TGCTTGGGAA TTTGGTGGCC ATGGAAGAGT AAGGATTTGC TTGAATAACT 
TTTGATAATA AGATAACAGA TGTAGGGTAC AGTTCTCTCA CCAAAAAGAA CTGTAATTGT 
CTCATCCATC TTTAGTTGTA TAAGATATCC GACTGTCTGA GTTCGGAAGT GTTTGAGCCT 
CCTGCCCTCC CCCTGCGTTG TTTAGCTAAT TCAAAAAGGA GAAAACTGTT TATTGATGAT 
CTTTGTCTTC ATGCTGACAT ACAATCTGTT CTCATGACAG ACTGGTCATG ATGTTGACCA 
TTTCACATCA CCAGAAGGAA TACCTGGAGT TCCAGAAACA AATTTCAATG GTCGTCCAAA 
TTCCTTCAAA GTGCTGTCTC CTGCGCGAAC ATGTGTGGTA CAGTTCTTGC CGTGTGACCT 
CCCTTTTTAT TGTGGTTTTG TTCATAGTTA TTTGAATGCG ATAGAAGTTA ACTATTGATT 
ACCGCCACAA TCGCCAGTTA AGTCCTCTGA ACTACTAATT TGAAAGGTAG GAATAGCCGT 
AATAAGGTCT ACTTTTGGCA TCTTACTGTT ACAAAACAAA AGGATGCCAA AAAAATTCTT 
CTCTATOCTC TTTTTCCCTA AACCAGTGCA TGTAGCTTGC ACCTGCATAA ACTTAGGTAA 
ATGATCAAAA ATGAAGTTGA TGGGAACTTA AAACCGCCCT GAAGTAAAGC TAGGAATAGT 
CATATAATGT CCACCTTTGG TGTCTGCGCT AACATCAACA ACAACATACC TCGTGTAGTC 
CCACAAAGTG GTTTCAGGGG GAGGGTAGAG TGTATGCAAA ACTTACTCCT ATCTCAGAGG 
TAGAGAGGAT TTTTTCAATA GACCCTTGGC TCAAGAAAAA AAGTCCAAAA AGAAGTAACA 
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GAAGTGAAAG CAACATGTGT AGCTAAAGCG ACCCAACTTG TTTGGGACTG AAGTAGTTGT 
TGTTGTTGAA ACAGTGCATG TAGATGAACA CATGTCAGAA AATGGACAAC ACAGTTATTT 
TGTGCAAGTC AAAAAAATGT ACTACTATTT CTTTGTGCAG CTTTATGTAT AGAAAAGTTA 
AATAACTAAT GAATTTTGCT AGCAGAAAAA TAGCTTGGAG AGAAATTTTT TATATTGAAC 
TAAGCTAACT ATATTCATCT TTCTTTTTGC TTCTTCTTCT CCTTGTTTGT GAAGGCTTAT ' 
TACAGAGTTG ATGAACGCAT GTCAGAAACT GAAGATTACC AGACAGACAT. TTGTAGTGAG 
CTACTACCAA CAGCCAATAT CGAGGAGAGT GACGAGAAAC TTAAAGATTC GTTATCTACA 
AATATCAGTA ACATTGACGA ACGCATGTCA GAAACTGAAG TTTACCAGAC AGACATTTCT 
AGTGAGCTAC TACCAACAGC CAATATTGAG GAGAGTGACG AGAAACTTAA AGATTCGTTA 
TCTACAAATA TCAGTAACAT TGATCAGACT GTTGTAGTTT CTGTTGAGGA GAGAGACAAG 
GAACTTAAAG ATTCACCGTC TGTAAGCATC ATTAGTGATG TTGTTCCAGC TGAATGGGAT 
GATTCAGATG CAAACGTCTG GGGTGAGGAC TAGTCAGATG ATTGATCGAC CCTTCTACCG 
ATTGGTGATC GCTATCCTTG CTCTCTGAGA AATAGGTGAG GCGAAACAAA AAATAATTTG 
CATGATAAAA AGTCTGATTT TATGATCGCT ATCCTCGCTC TCTGAGAAAG AAGCGAAACA 
AAGGCGACTC CTGGACTCGA ATCTATAAGA TAACAAAGGC GACTCCTGGG ACTCGAATCT 
ATAAGATAAC AAAGGCAATT CCAAGACTTG AATCTATAAA AAATTTAGTT AAGAATGATT 
AACGTCCGAT CCTAATTCGA ATCGAGGCAT CTTACCACTC CATTGATAAT TATATAAGTC 
AATAAGTCAT ATAAAGTATT AAAAACTAAA TTGACTTGAT CGGTCTATCA AAAATAGATA 10020 
AATTGTGTTC ATATGTAACA TTTTTGTTGT CACAATTAGC TTAATTACAT CTTTCATGTG 10080 
CAATAACAAA GAAATGATAG GAATTTAGAG ATTCCAATTT TTTTGTTGCC ACAATTAACT 10140 
TAATTACATC TTTCATTTGC AATAACAAAG AAATGATAGG AATTTAGAGA TCCAGTGTCA 10200 
ATACACAACC TAGGCCAACA TCGAAAGCAT AACTGTAAAC TCATGCATGA AGAAATCAGT 10260 
CGTAAAAATG AATAAATGCG ACATAAAAAC AAATTGCATG TATCATTAAT GTGACTTAAC 10320 
TACAAGTAAA AATAAATTTA ACAAATGTAA CTTAACTACA AGTAAAAATA AATTGCTTCT 10380 
ATCATTAACA AACAAACAGA ATTAAAAAGA AAAAAACATA CTAAATCTTA CCGTCATTCG 10440 
ATAAAAAAAA ATACCAAATT CATAATGCAA GGAAAACGAA ACGCGTCCTG ATCGGGTATC 10500 
AACGATGAAA TGGACCAGTT GGATCGACTG CCTGCACAAC GTTAGGTATG CCAAAAAAAA 10560 
GAACACGATC CTTTGCACCC GTTCGATGAT TATCAGTATG TTCACAAAAA AAACTTAAGT 10620 
TCATCCCAGT GTACAACAGC CCCAACATCT GCCCCAAGTA ACAAAAAACA ACCAATTTAT 10680 
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CTTATTCTTA 


rT ,__- r ,^, OAn TvaTMTrrrT TT P AC ACT AT TCTCTTGTTA TACAAAATTG 
TCTGCCACAA AATAft I Loo I 1 iLhwAL x a 


10740 


ACAAGTAGGA 


-»^^,*^*/-»/-»*/^ rr^nTp^KAT aaarGrrrGCA CGTTCTTTGA GAAAAGTCTT 
AGGAGAGGAG ' TCATCCAAAT MA^wiw^ v,\3i*vi**wi 


10800 


ATTTTTCGTA 


» ^-^^/-.^Tv Tvcprp rn P7 , nPnnarT tttcttCAAG TCAAAATTCC TGATAGTGTA 
AGATCCAATT TCAACAAACT i 1 1 v- A -H-i*/*o xwwxr** ■».ww 


10860 


TCTCCTCTCG 


,^^,,™™,T' rr^TTr^Brr aTrrrCGCTT ATCATGAAAA GTTGCTTGGA 
ACGACCTCTT GCATTGAACG A1\-1LL»j^ii a ^« j. 


10920 


TAACAAGTAT 


■ ^ * ~~~~~ /-rr-n TATTTiTifiTTZv GTCGGCCCAA GGAAATGGAG 

TGCAAGGGGG GGACAGTAGC TAiiMolia o x v_ovjv.v_ ~ 


10980 


GAGTGATAGT 


*mfn^Tn^^»T>r"r TTArr&TT^r CCGGTCTGGC TTTAAGGAGT 

CTCGAATATT ATTCACCTCT TTAGCAiiAL u\-ooj.n-av?vj^ ** 


11040 


TACGTCTTTT 


_ — , momrnnwP •TifP7vr'7\TkTr!r , T TGGTGTCAAA ATCGCGAGTT 

ACGCTCGCCA ATTTC TTTTT TTAGAATGGX xooion-urt** «j.^-vj%-«4-> 


11100 


GTGGAAGGTT 


_ ——fT,^- » nrry^rry rTvPTfinTORCI AGAGATTCGA 

CAAGTTACTC GATTCGTGAT TTTGAAGTAT o Ab ± oo l omo fturtoni*wn 


11160 


TATTTTCACG 


AGGTGTATTC GAGGTCTAGT AGAACGAAGG bl^l^wiMa 


11220 


AAGAGTTCAT 


___ mmmm/-i/^ ^T-Tirp /-» 7\ 7\ TV TCI ^ HT A TGAAAATTCT 

CATCATCTTC TTCTAGTAGA TTTTCGCTTT CAAAloAoiA i^wiMniiv.i 


11280 


TCCTCTTTTC 


TATTGATTTT CTTCATTGTT TTCTTCATTG TTGTGGTTGT TATTGAAAAG 


11340 


AAAGAAAATT 


TATAACAGAA AAAGATGTCA AAAAAAAGGT AAAATGAAAG AGTATCATAT 


11400 


ACTTAAAGAG 


TTGCGTAGAG ATAAGTCAAA AGAAACAGAA TTATAGTAAT TTCAGCTAAG 


11460 


TTAGAATTC 




11469 


(2) INFORMATION FOR SEQ ID NO: 30: 





(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc * "Synthetic DNA Primer" 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGAATTCCAG TCGCAGTCTA CATTAC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CGGGATCCAG AGGCATTAAG ATTTCTGG 
(2) INFORMATION FOR SEQ ID NO : 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGGATCCAA AGAAATTCTC GAGGTTACAT GG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
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CGGGATCCGG GGTAATTTTT ACTAATTTCA TG 32 
(2) INFORMATION FOR SEQ ID NO: 34: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer" 

15 (iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: YES 



20 



25 



35 



40 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

32 

CGGGATCCCG TATGTCTCAC TGTGTTTGTG GC 
(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc "Synthetic DNA Primer 

(iii) HYPOTHETICAL: NO 
(iv) ANTI - SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

45 32 
CGGGATCCCC CTACATACAT ATATCAGATT AG 

(2) INFORMATION FOR SEQ ID NO: 36: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer 1 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36; 
CCATCGATAC TTTAAGTGAT TTGATGGC 
(2) INFORMATION FOR SEQ ID NO : 37: 



28 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : s ingle 
(D) TOPOLOGY: linear 



50 



55 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic DNA Primer' 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
CGGGATCCTG TTCTGATTCT TGATTTCC 
35 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2122 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
45 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



28 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTATGTCTCA CTGTGTTTGT GGCTGTGTGT GTTTTTTTCT CTGTCTTTTT GTGTTTTGTG 
TAATTGGGGC TCTTTAAAGT TGGTATTGTG TATACCCTTT TGAGTATAGT CTTTGAGGAA 



60 
120 



WO 98/37213 



PCT/IB98/00270 



59 



10 



15 



20 



25 



30 



35 



40 
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GCAAAATGAT GAATCTTGAT TGACATTAGT AAGGGTTGTA 
GTGTAATTGA GTTTGGCTTG TGTGTCTGTG TGTCGAGGTT 
TGGGGATTCT TAAAAGTTGG TATTGTGTAT ACCCTTTTGA 
AAAATGATGA ATCTTGATTG GCATTAGTAA AGGTTGTAGC 
GTAATTGAGT TTGGCTTGTG TGTCTGTGTG TTTTGGAATC 
ATATGGGTCG AGGTTCTTTC TTTGGTTTGT GTAATTGGGG 
TGTACCTTTT TAAGAATAGT GTCTGAGAAA GCAAAATCGA 
ATTCTTTGAG AAAGCAAAAA ATGGTGAGTT TTCATGGAGA 
AGGTAGCAAC TTTTTCAACT CCTGATATGG GTCAAGGTTC 
TGGGGTTCTT TGAAGTTTTG AGAAAGAAAA ATTATGATTT 
ACATTAATAA AGGTAGTAGC TTTTTAAAGT GTGGTCAGCT 
TAAAGGGGCC CTACATATGG TGCTTTCTGG TGAGATATTT 
ATAAGAATCA TAGTGTTAGG ATCTTTTTTC TTTTTTTTTT 
ACTAGAGGAG TGATCTTGAC GGCGGAAAAT CTTAGAAAGG 
TGGTGTTATA TGTGCAAGGA GACGGGAGAT GATGTAGATC 
CTTTCCATGA GGTTATGATG TGATATGTTT GAATGGTTTG 
GAACTGTGAA AGAATTGATA TTCAGTTGGA AGTGTGGAGT 
CACTTGGTTC CATTAGCTTT AATGTGGGTG GTGTGGAGAG 
TTGAGGGGGT AGAGTTGAGC TTTCCTCAGT TGAGAAGTAG 
TTTTTTTGTA CACCCATAGA ATTCCCAATT GTATAGAAGA 
ATCATCTTTT GTAGTAGATT CTTTACCTTT TGGTATATCC 
TGACTATGTT TATGAATGAA TATACATTAC TTGAAAAAAA 
TGTACCTTTG TAGACAATGT TGTTGCAGCA TCTTGATAAT 
GAAGGAATAG TTTGGTTGAT ATTGATTATT TCTTGGTTTG 
GGCCATTTTA AATCCTTTGA CATTGTTAAA GGTGTTTACA 
AGCACCTCTT GTATGGTGCT TTCTGGAGTG ATCTTTCTTC 
GAAtCAGTGT GTGTACTTTT TTCTCTTGTA TGATCAGATC 
TAGTTGATTT ATCCATATAG TGAAAGTTGG TGTCATAGTT 
AAAAGTTTTT TGATATACTT AAAAAATTGT CACACAGAAG 



ACTTTTTGAA 

ATTTTTTTGG 

GTATAGTCTT 

TTTTTGAAGT 

CTGATGTGTG 

GTTCTTAAAA 

TGAATTTTGA 

AACTTGATTG 

TTTGTTTGGT 

TTCATGGAGA 

GTAATGAGTT 

GTTGCTCCAC 

CATTTTTCAC 

GGAAGGTTGT 

ATCTTCTTCT 

GTACTTCTTG 

TGGAAGAGTG 

AGAGAGAAAT 

CCTTTGATAT 

TTGGGTGGAG 

ATTGTATACA 

AAGAAGTGAA 

TCCCTGAAAA 

TTTAATTCGG 

AGTGTTGGTC 

CTCCAAAAGA 
TTTTTTCAAT 

GCTGTTTGTG 
AAAGAGTTTT 



GTTTGGTTAG 
TTTGTGTTAT 
TGAGGAAGCA 
GTGGTTAGGT 
TCAAGTCCTG 
GTTGGTATTA 
TTGACAGCAT 
ACATTACTAA 
TTGTGTAATT 
AATTTGATTT 
CAGCTTGGTT 
CATACGAGTT 
TTGACTAGCT 
TTGCATCAAC 
TCATTGTGGT 
GCTATGCCAA 
GAAGAATTGA 
AGGAGAGCTT 
CTTTTTTTTT 
TTTGTAGAGA 
GCCAGGCCTT 
GCCAGTCTGT 
TTGTCTCCCT 
TGTTCTTGAA 
TGGGTTTAAA 
GAAGTTGCAA 
TTTTCCGTTT 
GACTTCCTGT 
TTACCATTAC 



180 

240 

300 

360 

420 

480 

540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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TTAAGCTAGA TGGGACTGTT TGATTCTTAG ACCAAATAAT GAACCTTTTT GTTCTCTTAA 1920 
CGTGTACTTG AAATAGTTTG GTAAAATTGT GATAGGAAAA AAGATAATTC TTGATTGCTT 1980 
TTGGAGCATC ACTTCTAATC ATAAAAGTCT TTGCTCTCTT CAACCATGAA TGATAAATTG 2040 
GACACTTATG TGGCCCTAAG TTGCTCTCAG TAGTGGTCTT TAATTGTGGA GATATAACTA 2100 

2122 

ATCTGATATA TGTATGTAGG GA 
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CLAIMS 

1 A method of affecting enzymatic activity in a plant (or a cell, a tissue or an 
organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a 

5 nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for 
an intron of a class A potato starch branching enzyme in an antisense orientation, 
optionally together with a nucleotide sequence which codes, partially or completely, for 
an intron of a class B starch branching enzyme in an antisense or sense orientation; and 
wherein the nucleotide sequence does not contain a sequence that is antisense to an exon 

10 sequence normally associated with the intron. 

2. A method according to claim 1 wherein starch branching enzyme activity 
is affected and/or wherein the levels of amylopectin are affected and/or the composition 
of starch is changed. 

15 

3. A method of affecting enzymatic activity in a starch producing organism 
(or a cell, a tissue or an organ thereof) comprising expressing in the starch producing 
organism (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the 
nucleotide sequence codes, partially or completely, for an intron of a class A starch 

20 branching enzyme in an antisense orientation, optionally together with a nucleotide 
sequence which codes, partially or completely, for an intron of a class B starch branching 
enzyme in an antisense or sense orientation; and wherein starch branching enzyme 
activity is affected and/or the levels of amylopectin are affected and/or the composition of 
starch is changed. 

25 

4. A method according to claim 3 wherein the nucleotide sequence does not 
contain a sequence that is antisense to an exon sequence normally associated with the 
intron. 

30 5 . A method according to any one of the preceding claims wherein the 

enzymatic activity is reduced or eliminated. 
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6. A method according to any one of the preceding claims wherein the 
nucleotide sequence codes for at least substantially all of at least one intron in an antisense 
orientation. 

5 

7. A method according to any one of the preceding claims wherein the 
nucleotide sequence codes for all of at least one intron in an antisense orientation. 

8. A method according to any one of the preceding claims wherein the 
10 nucleotide sequence comprises the complement of SEQ. ID. No. 38, or a fragment 

thereof. 

9. A method according to any one of the preceding claims wherein the 
nucleotide sequence is expressed by a promoter having a sequence shown as SEQ.I.D. 

15 No. 14 or a variant, derivative or homologue thereof. 

10. An antisense sequence comprising the nucleotide sequence as defined in 
claim 8 or a variant, derivative or homologue thereof. 

20 11. A promoter having a sequence shown as SEQ.I.D. No. 14, or a variant, 

derivative or homologue thereof. 

12. A promoter according to claim 11 in combination with a gene of interest 
("GOD. 

25 

13 . A construct capable of comprising or expressing the invention according to 
any one of claims 10 to 12. 

14. A vector comprising or expressing the invention according to any one of 
30 claims 10 to 13. 
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15. A combination of nucleotide sequences comprising a first nucleotide 
sequence coding for a recombinant enzyme; and a second nucleotide sequence which 
corresponds to an intron in antisense orientation; wherein the intron is an intron that is 
associated with a genomic gene encoding an enzyme corresponding to the recombinant 

5 enzyme; and wherein the second nucleotide sequence does not contain a sequence that is 
antisense to an exon sequence normally associated with the intron. 

16. A cell, tissue or organ comprising or expressing the invention according to 
any one of claims 10 to 15. 

10 

17. A transgenic starch producing organism comprising or expressing the 
invention according to any one of claims 10 to 16. 

18. A transgenic starch producing organism according to claim 17 wherein the 
15 organism is a plant. 

19. A starch obtained from the invention according to any one of the preceding 

claims. 

20 20. A nucleotide sequence that is antisense to an intron of class A SBE. 

21. A method for modifying starch production in an organism, comprising 
transforming the organism with a transgene capable of expressing an antisense intron 
sequence relating to class A SBE and a transgene capable of expressing an antisense 
25 intron sequence relating to class B SBE, thereby reducing or eliminating endogenous class 
A and class B production, and a farther sequence encoding a SBE from a heterologous 
source. 
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ATCATGGCCAATTACTGGTTCAAATGCATTACTTCCTTT^ 

GACCGGTCCTACTACAGACGATACTAACCCGTGGAACTGTTGCATCTGCTTCTTAGAACT 

CTATGGCTATTTTCGTTAGCTTGGCGTCGGTTTGAACATAGTTTTTG l'l l'ICAAACTCTT 180 

CATTTACAGTCAAAATGTTGTATGGTTTTTGTTTrcCTC^ 240 

TTGTCATCTGTACTTTTGCCTATTACTTGTTTTGACT 300 

TTGCCATATTTTGTTCTCTTATTATTATTATCATACATACATTATTACAAGGAA 360 

AGTACACAGATCTTAACGTTTATGTTCAATCAACTTTTGGAGGCATTGACAGGTACCA 420 

AATTTTGAGTTTATGATTAAGTTCAATCITAGAATATGA^ 480 

C^TAAAAATAGCTAATGATAGAACATTGACATTTGGCAGA 540 

AACGTTAATTTAGTAATTTTTGTTACGTACGTATATGAAATATTGAATTAATCA 600 

CGGTGGATATTATATTATGAGTTGGCATCAGCAAAATCATTG^^ 660 

GCA.GATTTAATAATAAAATGGTAATTAACGGTCGATATTAAAATAACTCT 720 

GGGATTAGAACTAGTTATTAAAAAAATCTATACT 780 

AMGTTTTTCATTTCATGCTAAAATTGTTAATTATTCT 840 
ATTATAGrCTT^AATTTATGCATTCAGTGTAAAATTAAACT 900 
AAA&TACTTTATACTTTAATATAGGATTTTGTCATG 960 

ACACGGAATACCAAAATTAAAAAGGATACACATGGCCTTCATATGAACCGTGAACCrTTG 1020 

ATAACGTGGAAGTTCAAAGAAGGTAAAGTTTAAGAATAAACTGAC^ 1080 

ATTTGGCCCACTACTAAATTTGCTTTACTTTCTAACA 1140 

GAATGATArrCArmTCATCCCATAAGTTCAATTT^ 12 00 

CTGAAAAATGCTTGGCCATTCACAAAGTTTATCTTAGTTCCT 1260 

TTTMTTTGACATGTTATTTATATTAGATGATATAATCCATGACCCAATAGACA 1320 

TrAATATTGTAACTTTCTAATTGAGTGICTCTACATCTTATTCAATC 1380 
AAMTAAATTATTrTITCACATTCTAAAACTTTAAGCA 144 0 
TAAAAACAAAAAACGACTTATTTATAAATCAACAAAC 1500 

FIG. 12 
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TTTTCCAAATTAAATGCAGAAAATGCATAATTITATACTC 1560 
TTTAGCCTAACCAACGAATATTTGTAAACTCACAACTTGATTAAAAGGGATTTACAACAA, 1620 
GATATATATAAGTAGTGACAAATCTTGATTTTAAATATTTTAATTTGGAGGTCAAAATCT 1680 

TACCATAATCATTTGTATTTATAATTAAATTTTAAATATC 1740 
AACTTTTAAATATACGTATATACAAAATATAAAATTATTGGCGTTCATATTAGGTCAATA 1800 

AATCCTTAACTATATCTGCCTTACCACTAGGAGAAAGTAA^ I860 
CATGTATTATGTATACAAAAAGTCGATTAGATTACCTAAATAGAAATTGTATAACGAGTA 1920 

AGTAAGTAGAAATATAAAAAAACTACAATACTAAAAAAA^ 1980 
AAACTMTGGGGTCTGAGTGAAATATTCAGAAAGGGG^ 2040 
r.o»T v T v rT*rfrTT3^ & &&rv~r^rTAAAATGAGGAAATCAAGAATCAGAACATACAAGAAGGCA 2100 

GCAGCTGAAGCAAAGTACCATAATTTAATCMTGGAAATTAATTTC^AA 2160 

ME INFKVLSK 
ACCCATTCGAGGATCTTTTCCATCTTTCTCACCTAAAGTTTCTTCAGGGg taa 1 1 1 1 1 ac 2220 

P1RG SFPSFSP KV SSG _ 
taattccatgttaatttcaattacttttagcctttgcacttcactttccaacatatctgg 2280 

atcatctccttagtttttcacttcatttcttataatatcaaatatggaagaaaaatgaca 2340 

cttgtagagccatatgtaagtatcatgcgacaaatttgcaaggtggttgagcgtataaaa 2400 

ttcaaaaattgagagacggagggggggtgggggbaragacaacatttagaaagagtgtcc 2460 

taggaggttatggaggacacggatgaggggtagaaggttagttaggtacttgagtgttgc 2520 

ctggcttatcccttcatactagtagccgcggaattattcgggcagtttcctgttttgtta 2580 

tttgacctctgttattctattttctgtcccttgtacctcgattactgtattacacaccct 2640 

gccgtagctatcgttcctcggtaagaatgctctagcatgcttcccttagtgtcctatcac 2700 

gccttctttatattcgcgttgctttgaaatgcttttactttagccgagggtctaccagaa 2760 

acaatctctctatctcgtaaggcaggggtaaagccctcaccacactccacttgtgggact 2820 

acattgtgtttgttgttgtaaatcaattatgtatacataataagtggattttttacaaca 2880 

caaatacatggtcaagggcaaagttccgaacacataaagggttcattatatgtccaggga 2940 

tatgataaaaactgtttctttgtgaaagttatataagatttgttatggcttttgctggaa 3000 

FIG. 12 CONTINUED 
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acacaataagttataatgccgagatagccactgaagttcgtttcctctagccttttaaat 3060 
gcaccaataatagattccgcatcgaacgagcacgttttgattacccggtcatgacgttcc 3120 
tattccttacatttcttcggtgttgaactgcaatcgaaaatgctgtatcccatgagacgg 3180 
atagttgagaatgtgctctttgtatggaccttgagaagctcaaacgctactccaataatt 
tctatgaattcaaattcagtttacggctaccagtcagtccagaaattaggatatgctgca 
tacactcgtccaactacactgtaaaatttctcaagttctcaagatatccatgtaacctcg 



3240 
3300 
3360 



agaatttctttgacagGCTTCTAGAAATAAGATATCTTTTCCTTCT^ 3420 

ASRNKICFPSQHSTG 

ACTGAAGTITGGATCTCAGGAACGGTCTTGGGATATTTCTTCCACCCCAAAATC^ 3480 

LKFGS QERS WD ISST PKSRV 

TAGAAAAGATGAAAGGgtatgtctgataatttatatggctgcatggacagtatacaaaca J:><iu 

3600 



3660 
3720 



R K D E R 

gccggaaaacctctggactggcgcccatggcacacttgacccgtgcaccgtgtggagacg 

ccaaacatgtgttacttcgttccgccaatttataacacctcaacttgggaaagacagccc 

cttactcctgtgggcatttgttattcgaactacaatctttatgagcatggcgccttcaca 

ttaccaacttctttcatgtggcacataacagttttcagctccgccaacaccttccctcct 3780 

tttgatataaactaactgtggtgcattgcttgcbkJck^ 3840 

CGCTGTTTTGACCGATGACGACAATTCGACAATGG^ 3900 
AVLTDDDN ST MAPLEEDVKT 

TGAAAATATTGGCCTCCTAAATTTGGATCCAACTTT^ 

EN IGLLNLDPTLEPYLDHFR 
ACACAGAATGAAGAGATATGTGGATCAGAAAATGCTCATTGAAAAATATGAGGGA 

HRMKRYVDQKMLIEK YEG P L 
TGAGGAATTTGCTCAAGgtaacagccaaaagttgtgccttaggcagtttgaccttatccc 

E E F A Q G ' 
ggaagacgaattgcttatacctacctcgacctcgctagagaactctgcataccggggagt 

aagtagtggctccatttaggtggcacctggccattttttcgaccctttaaaaagctgttc 

gattgggtcctcaaaaaagtagacaaggtttttggagaagtgacacacccccggagtgcc 4260 

agtggcaaagcaaagattttcactaaggagattcaaaacataaaaaaagtatagacataa 4320 

agaagctgaggggattcaacatgtactatacaagcatcaaatatagccttaaagcaattc 4380 

tgtagaaataaagaaagtcttccttctgttgcttcacaatctccttctactaccatgagt 4440 

tactctttctgttcgaaatagcttccttaatatcaaattcatgatacctttgttgagact 

FIG. 12 CONTINUED 
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caccagctccttctcgcgcaaactgcccccctttcctgcagGTTATTTAAAATTTGGAp 

Y L K F G r 
CAAC^GGGAAGATGGTTGCATAGTCTATCGTGAATGGGCTCCTGCTGCTCAgtaggCCCC 

NREDGCIVYREWAPAAQ 
cgcctaccacaaaatagtagtctccaccatcacaacagacccccctatcaaagcacgatg 

ccgcagcatcattggcctccttacacgttctaactgctatcaaggccacgcttctaatca 

actcatccacaat gc agGGAAGCAGAAGTTATTGGCGATTTCAATGGATGGAACGGTTCT 4800 

EAEVIGDFNGWNGS 
AACCACATGATGGAGAAGGACCAGTTTGGTGTTTGGAGTATTAGAATTCCTGATGTTGAC 4860 

NHMMEKD QFGVWS IRI PDVD 
AGTAAGCCAGTCATTCCACACAACTCCAGAGTTAAGTTTCGTTTCAAACATGGTAATGGA 4920 



4560 
4620 
4680 
4740 



SKPVIPHNSRVKFRFKHGN 
GTGTGGGTAGATCGTATCCCTGCTTGGATAAAGTATGCCACTGCAGACGCCACAAAGTTT 4980 

V W V D R I P A W I K Y A T A D A T K F 
GCAGCACCATATGATGGTGTCTACTGGGACCCACCACCTTCAGAAAGgtCttgCCatCCa 

AAPYDGVYWDPPPSER 
taccttgaagctgaattttgaacaccatcatcacaggcacttcgattcatgttctcacca 



gtcttgttacgtaagacattccgaaatgcaaaagtcaaaacaactgcgtcttcactaact 
cggacttgatcccatactctttcccttaacaaaacgagtcaactctataagtgcttgaga 



5040 
5100 
5160 
5220 



acCtactactCcagcaattaaacagGTACCACTTCJ^TACCCTCGCCCTCCCAAACCCC 5280 

YHFKYPRPPKPR 
GAGCCCCACGAATCTATGAAGCACATGTCGGCATGAGCAGCTCTGAGCCACGTGTAAATT 5340 

APRIYEAHVGMSSSEPR VNS 
CGTATCGTGAGTTTGCAGATGATGTTTTACCTCGGATTAAGGCAAATAACTATAATACTG 5400 

YREFADDVLP R IKANNYNTV 
TCCAGTTGATGGCCATAATGGAACATTCTTACTATGGATCATTTGGATATCATGTTACAA 5460 

QLMA IMEHSYYGSFGYH V T M 
ACTITrrnXTGTGAGCAGTAGATATGGAAACCCGGAGGACCT 5520 

FFAVSSRYGNPEDLKYLIDK 
AAGCACATAGCTTGGGTTTACAGGTTCTGGTGGATGTAGTTCACAGTCATGCAAGCAATA 5580 

AHSLGLQVL VDVVHSHAS NN 
ATGTCACTGATGGCCTCAATGGCTTTGATATTGGCCAAGGTTCTCAAGAATCCTACT^ 5640 

VTDGLNGFDIGQGSQ ESYFH 
ATGC TGGAGAGC GAGGGTAC CAT AAGTTGTGGGATAGCAGGCTGTTCAACT ATGCCAATT 5700 

AGERG YH KLWDSRLFNY A NW 
GGGAGGTTCTTCGTTTC C l^UriTCCAACTTGAGGTGGTGGCTAGAAGAGTATAACTTTG 5760 

EVLRFLLSNLRWWLEEYN F D 
ACGGATTTCGATTTGATGGAATAACTTCTATGCTGTATGTTCATCATGGAA^ 5820 

GFRFDG ITSML.YVHKG INMG 
GATTrA<^GGAAACTATAATGAGTATTTCAGCGAGGCTAC\GATGTTGATGCTCrGGTCT 5880 

FTGNY NEYFS EATDVDAVV Y . 
ATTTAATGTTGGCCAATAATCTGATTCACAAGATTTTCCCAGATGCAAC^ 5940 

LMLANNL IHKI FPDATV I A E 
AAGATGTTTCTGGTATGCCGGGCCTTGGCCGGCCTGTTTCTGAGGGAGGAATTC 6000 

DVSGMPGLGRP VSEGGIGFV 

FIG. 12 CONTINUED 
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TTTACCGCCTGGCAATGGCAATCCCAGATAAGTGGATAGATTATTTAAAGAATAAGAATG 6060 
YRLAMAIPD KWIDYLKNKND 

ATGAAGATTGGTCCATGAAGGAAGTAACATCGAGTTTGACAAATAGGAGATATACAGAGA 6 120 
EDWSMKEVTSSLTNRRYTEK 

AGTGTATAGCATATGCGGAGACCCATGATCAGgtactttaaacttatttctacaaccaaa 

CIAYAE THDQ 
taattctcagaacaattgttagatagaatccaaatatatacgccctgaaagtacaaaagt 



6480 
6540 
6600 
6660 



6180 
6240 

acttactctcgccatgggccctcagaatactggtagccgctgaataccatgacaagttac 6300 

ctatccagcgacattcttatgctcacccccattatgcctgctggacacagTCTATTGTTG 6360 

S I V G 

GTGACAAGACCATTGCATTTCTCCTAATGGACAAAGAGATGTATTCTGGCATGTCTTGCT 6420 
DKTIAFLL MDKEMYSGMS CL 

TGACAGATGCTT CTC CTGTTGTTGATCGAGGAATTGCGCTTCACAAGg tttgtCtgCCLC 
TDASPVVDRGIALHK 

tattgcattctaaggttcatataggttagccacggaaaatctcactctctgcgaggtaac 

cagggttctgatggattactcaacttccccgtttatcactcgctcatcctcttcacgcat 
tgtgtttctttctcaatatccctctcatttggaggtaactttcctcatctatccaccttt 

agcCCCtaaccacagATGATCCATTTTTTCACAATGGCCTTGGGAGGAGAGGGGTACCTC 6720 

M I HFF TMALG OE.G Y L 
AAirrCATGGGTAACGAGgtatgtcttacatctctagatactctgtgataattacaatca 6780 

N F M G N E 

gtctggcttacttgaacaagactcattcctcaaaatgacctgaaccgttgaacatcaaag 

gggttgaaacatagaggaaaacaacatgatgaatgcctccattgtctagggatttctatc 

atgttgctgagaacaaatgtcatcttaaaaaaaacattgtttacttttttgtagtataga 

agattactgtatagagtttgcaagtgtgtctgttttggagtaattgtgaaatgtttgatg 

aact tgtacagTTTGGCCATCCTGAGTGGATTGACTTCCCTAGAGAGG 7080 

FG H PEW IDF P REGNNWS 
GTTATGACAAATGTAGACGCCAGTGGAACCTCGCGGATAGCGAACACTTGAGATACAAGg 7 140 

YDKCRRQWNLADSE HLRYK 
tccaagtactttgaatcgcagcctgttaaataatctagtaatttttagattgcttacttg 7^oo 

gaagtctacttggttctggggatgatagctcatttcaccctgtcctacttattttccaac 7260 

cgaatctctgacccttgtttcgagacccaagcattagatccacttacacctactaccgcc 7320 

tcatttctaccactaaggccttgatgagcagcttaagttgattctttgaagctatagttt 

caggctaccaatccacagcccgctacatttgttggatacttaccccttctttacaatgaa 

gtgatactaattgaaatggcctaaatccgatatctatattcccccgcccttcctccccct 
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cat c a t gaaac gc agTTT ATGAATGCATTTGATAGAGCTATGAATTCGCTCGATGAAAAG 7560 

FMNA FDRAMN SLDEK 
TTCTCATTCCTCGCATCAGGAAAACAGATAGTAAGCAGCATGGATGATGATAATAAGgca 
FSFLASGKQIVSSMDDDNK 
aaatcatctaaagttgaaagtgttgggtttatgaagtgccttaattctatccaaggacaa 



7620 
7680 

gtagaaacctttctaccttccatttcttgatgatggatttcatattacttaatccaacag 7740 



ctggtcaaattcggtaatagctgtactgattagttacttcatt^ 



7800 
7860 



TGAACGTGGTGACCTGGTATTTGTATTCAACTTCCACCCAAAGAACACATACGAAGGgta 

ERGDLVFVFNFHPKNTYEG 

tatatgttttacttatccacgaaattattgctctgcttgtttttaatgtactgaacaagc 7920 

tttatggagaagcaactgaaacaaatcattttcacattgtctaatctaacccttttttct 7980 

gat cc t cgca t gacgaaaac agGTATAAAGTTGGATGTGACTTGCCAGGGAAGTACAGAG 8040 

YKVGCDL PG KYRV 

TTGCACTGGACAGTGATGCTTGGGAATTTGGTGGCCATGGAAGAg t aagga 1 1 1 gc 1 1 ga 8100 

ALDSDAW EFGGHGR q1C a 

ataacttttgataataagataacagatgtagggtacagttctctcaccaaaaagaactgt 8i&u 

aattgtctcatccatctttagttgtataagatatccgactgtctgagttcggaagtgttt 8220 

gagcctcctgccctccccctgcgttgtttagctaattcaaaaaggagaaaactgtttatt 8280 

gatgatctttgtcttcatgctgacatacaatctgttctcatgacagACTOrr^T^^ 



TGACCATTTCACATCACCAGAAGGAATACCTGGAGTTCCAGA 

DHFTSPEGI P GVPETNFNGR 
TCCAAATTCCTTCAAAGTGCTGTCTCCT 

PNSF KVLSPARTCV ■ 
tgacctccctttttattgtggttttgttcatagttatttgaatgcgatagaagttaacta 8^u 



8340 
8400 
8460 



8580 
8640 
8700 



ttgattaccgccacaatcgccagttaagtcctctgaactactaatttgaaaggtaggaat 
agccgtaataaggtctacttttggcatcttactgttacaaaacaaaaggatgccaaaaaa 
attcttctctatcctctttttccctaaaccagtgcatgtagcttgcacctgcataaactt 
aggtaaatgatcaaaaatgaagttgatgggaacttaaaaccgccctgaagtaaagctagg 8760 
aatagtcatataatgtccacctttggtgtctgcgctaacatcaacaacaacatacctcgt 8820 
gtagtcccacaaagtggtttcagggggagggtagagtgtatgcaaaacttactcctatct 
cagaggtagagaggattttttcaatagacccttggctcaagaaaaaaagtccaaaaagaa 
gtaacagaagtgaaagcaacatgtgtagctaaagcgacccaacttgtttgggactgaagt 9000 
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. agttgttgttgttgaaacagtgcatgtagatgaacacatgtcagaaaatggacaacacag 9060 
ctatttcgcgcaagtcaaaaaaacgcaccactatttccttgtgcagcttcacgtatagaa 9120 
aagttaaacaaccaatgaattctgctagcagaaaaatagcctggagagaaactttttata 9180 
ttgaactaagccaactatattcacccttctttttgcttcttcttctccttgtttgtgaag 9240 

9300 
9360 



9600 
9660 



GCTTATTACAGAGTTGATGAACGCATGTCAGAAACTGAAGATTACCAGACAGACATTTGT 
AYYRVDERMSETEDYQTDIC 
AGTGAGCTACTACCAACAGCCAATATCGAGGAGAGTGACGAGAAACTTAAAGATTCGTTA 
SELLPTANIEESD EKLKDSL 
TCTACAAATATCAGTAACATTGACGAACGCATGTCAGAAACTGAAGTTTACCAGACAGAC 9420 
STNISNIDERMSETEVYQ TD 
ATTICrAGTGAGCTACTACCAACAGCCAATATTGAGGAGAGTGACGAGAAACTTAA^ 9480 
ISSELL PTANIEESDE KLKD 
TCGTTATCTACAAATATCAGTAACATTGATCAGACTGTTGTAGTTTCTGTTC 9540 
SLSTNISNIDQTVVVSVE E R 
GACAAGGAACTTAAAGATTCACCGTCTGTAAGCATCATTAGTGATC 
DKELKDSPSVSIISDVVPAE 
TGGGATGATTCAGATGCAAACGTCTGGGGTGAGGACTAGTCAGATGATTGATCGACCCT 

WDDSDANVWGED 
CTACCGATTGGTGATCGCTATCCTTGCTCTGTGAGAAATAGGTGAGGCGAAACAAAAAAT 9720 

AATTTGCATGATAAAAAGTCTGATTITATGATCGCrATCCTCGCTCTCTGAGA 9780 

GAAACAAAGGCGACTCCTGGACTCGAATCTATAAGATAACAAAGGCGACTCCTCGGACTC 9840 

GAATCTATAAGATAACAAAGGCAATTCCAAGACTTCAATCTATA 9900 

ATGATTAACGTCCGATCCTAATTCGAATCGAGGCATCTTACCACTC 9960 

TAAGTCAATAAGTCATATAAWAGTATTAAAAACTAAATTGACTTGATCGGTCT 10020 

ATMAGATMAAATT'GTGTTCATATCTAACATTIT^ 10080 

TTTCATGTGCAATAACAAAGAAATGATAGGAATTTAGAGATTCCAATTTTT^ 10140 

CAATTAACTTAATTACATCTTTCATTTGCAATAACAAAGAAATGATAGGAATTTAGAGAT 10200 

CCAGTGTCAATACACAACCTAGGCCAACATCGAAAGCATAACTGTAAACTCATGCATGAA 10260 

GAAAT CAGTCGTAAAAATGAAT AAATGC GACATAAAAACAAATTGCATGT ATCATT AATG 10320 

TGACTTAACTACAAGTAAAAATAAATTTAACAAATGTAACTTAACTACAAGTAAAAATAA 10380 

ATTGCTTCTATCATTAACAAACAAACAGAATTAAAAAGAAA^A 10440 

CGTCATTCGATAAAAAAAAATACCAAATTCATAATGCAAGGAAAACGAAACGCGTCCTGA 10500 
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10560 



CAAAAAAAAGAACACGATCCTTTGCACCCGTTCGATGATTATCAGTATGTTCACAAAAAA 10620 

AACITAAGTTCATCCCAGTGTACAACAGCCCCAACATCTGCCCCAAGTAA 10680 

CCAATTTATCTTATrCTTATCTGCCACAAAATAATCGG 10740 

ACAAAATTGAC^GTAGGAAGGAGAGGAGTCATCCAAATAAACGGTGCACOT 10800 
AAAAGTCTTATTTTTCGTAAGATCCAATTTCAACAAACTTTT^ 



10860 
10920 
10980 



TTGCTTGGATAACAAGTATTGCAAGGGGGGGACAGTAGCTATTAAGTTAGTCGGCCCAAG 

GAAATGGAGGAGTGATAGTCTCGAATATTATTCACCTC 11040 

TTAAGGAGTTACGTCTTTTACGCTCGCCAATTTCTTTTTTTAGA^ 11100 

TCGCGAGTTGTGGAAGGTTCAAGTTACTCGATTCGTGATTTTC^ HI 60 

GAGATTCGATATTTTCACGAGGTGTATTCGAGGTCTAGTAGAACGAAGG 11220 

GAAAGTTTCAAGAGTTCATCATCATCTTCTTCTAGTAGATTTTC 11280 

GAAAArrcrrccTcriTrcTATrGATTiTCTrc^ 11340 

ATTGAAAAGAAAGAAAATTTATAACAGAAAAAGATGTCAAAAAAAAGGTAA 11400 

GTATCATATACTTAAAGAGTTGCGTAGAGATAAGTCAAAAGAAACAGAATTATACTMTT 11460 

1 1 A. "70 

TCAGCTAAGTTAGAATTC 
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GTATACACTCTCTGGAGTTCGTTTTCCTACTGTTCCATCAGTGT 60 
YTLSG VRFPTVPSVYKSNGF 

Sspl 
Bsral 

CAGCAGTAATGGTGATCGGAGGAATC^^^TTTCTCT 120 
SSNGDRRNANISVFLKKH SL 



BsaAI 

TTCAjSgtatgtcccactgtgtttgtggctgcgtgtgtttttttctctgtctttttgtgtt 180 
S R 



Bspl286I 
Banll 



ttgtgtaattggggctctttaaagttggtattgtgtatacccttttgagtatagtctttg 



aggaagcaaaatgatgaatcttgattgacattagtaagggttgtaactttttgaagtttg 



agcaaaaatgatgaatcttgattggcattagtaaaggttgtagctttttgaagtgtggtt 



240 



300 



gttaggtgtaattgagtttggcttgtgtgtctgtgtgtcgaggttatttttttggtttgt 360 



gttattggggatcttaaaagttggtattgtgtatacccttttgagtatagtctttgagga 420 



480 



aggtgtaattgagtttggcttgtgtgtctgtgtgttttggaatcctgatgtgtgtcaagt 540 
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cctgatatgggtcgaggttctttccttggtttgtgtaactgggggttcttaaaagttggt 600 

Clal 
BSpDI 

attatgtacctttttaagaatagtgcctgagaaagcaaaatcgatgaattttgattgaca 660 
gcatattctttgagaaagcaaaaaatggtgagttttcatggagaaacttgattgacatta 720 



rtaaaggtagcaactttttcaactcctgatatgggtcaaggttctttgtttggtttgtgt 



780 



tttggggttctttgaagttttgagaaagaaaaattatgatttttcatggagaaatttg 840 



aai 



PvuII 

Asel NspBII 
atttaca?taataaaggtagtagctttttaaagtgtggtcagctgtaatgagttcagctt 900 

Bspl286I 
Banll 

Apal Ndel 

ggtttaaaggggcccctacatatggtgctttctggtgagatatttgttgctccaccatac 960 



gagttataagaatcatagtgttaggatcttttttcttttttttttcatttttcacttgac 1020 
tagctactagaggagtgatcttgacggcggaaaatcttagaaaggggaaggttgtttgca 1080 
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Esp3l BsaBI 
tcaactggtgttatatgt^caaggagacgggagatgatg^gatcatcttcttcttcatt 1140 



gtggtctttccatgaggttatgatgtgatatgtttgaatggcttggtacttcttggctat 



1200 



Earl 

1260 



gccaagaactgtgaaagaattgatattcagttggaagtgtggagttggaagagtggaaga 



attgacacttggttccattagctttaatgtgggtggtgtggagagagagagaaataggag 



1320 



EcoFV 

1380 



agcttttgagggggtagagttgagctttcctcagttgagaagtagcctttgatatctttt 



EcoRI Muni 

ttttttttttttgtacacccatagaattcccaattgtatagaagattgggtggagtttgt 1440 



agagaatcatcttttgtagtagattctttaccttttggtatatccattgtatacagccag 



1500 



Stul 

gcctttgactatgtttatgaatgaatatacattacttgaaaaaaaaagaagtgaagccag 1560 



tctgttgtacctttgtagacaatgttgttgcagcatcttgataattccctgaaaattgtc 1620 
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tgaaggaatagtttggttgacattgattatttcttggtttgtttaattcggtgttc 1680 



ttgaaggccattttaaatcctttgacattgttaaaggtgtttacaagtgttggtctgggt 



1740 



tta. 



aaagcacctcttgtatggtgctttctggagtgatctttcttcctccaaaagagaagt 1800 



Bell Bglll 

tgcaagaatcagtgtgtgtacttttttctcttgta^atcagatcttttttcaatttttc I860 



rgttttagttgatttatccatatagtgaaagttggtgtcatagttgctgtttgtggactt 



1920 



cc 



tgtaaaagttttttgatatacttaaaaaattgtcacacagaagaaagagttttttacc 1980 



Aflll 

attacttaagctagatgggactgtttgattcttagaccaaataatgaacctttttgttct 2040 



AfllH 

cttaacgtgtacttgaaatagtttggtaaaattgtgataggaaaaaagataattcttgat 2100 



Earl 

tgcttttggagcatcacttctaatcataaaagtctttgctctcttcaaccatgaatgata 2160 
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Sf Cl 
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